FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACAT TACTGCAGTAACACTCCACCATATAGACCCGGCT T TACCT TATATCAGTGACACTGG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
f CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 

IN 

Ill TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
=S TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 

** »* •* 

LlJ ATCAGTTT TGCACAGT GGCAAT TTTGGGACTGAT TTAGAACAGAAACTCCAT TGGAACCCCGAGG 

ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
f{ GGTTTTTTCCT GAC T T AC AT TCGTGATTTT CAGAAAAT T T CT T T AC GGG T GGAAG CCAAT T TACA 

TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 
i*\ GAGATAT T TGAT GAAAGGAT AAAATAT T T C TGTAATGAT TAT GATTC T CAGG GAT TGGGGAAAGG 

If] TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
CI GAT GAAT GCT GATAAT CAGGAAACAT GAAAGAAGCGAT T T GATAGAT TAT T C TAAAGGATAT CAT 

«•«»■■ 

^ h «- 

CAAGAAGACTAT TAAAAACACCTATGCCTATACT TT T TTATCTCAGAAAATAAAGTCAAAAGACT 

# i' 

ATG 
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FIGURE 2 



<subunit 1 of 1, 2 66 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

M W W FQQGLSFLP S AL V I W T S AAF I FS Y I T AVT L HH I D P AL P Y I S D T G T VAPE KCL FGAMLN I AAV 
LC I AT I YVRYKQVHALS PEENVI I KLNKAGLVLG I LS CLGLS I VANFQKT TL FAAHVS GAVLT FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 



Important features : 



Type II transmembrane domain: 



amino acids 13-33 




Other Transmembrane domains: 



amino acids 54-73, 94-113, 160-180, 122-141 



N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 



ft ft 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCGCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
, CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
r-l CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
rl TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
IP C TCAGGAATGT GACTGACAC GGATATT CTAGCC CTGGAACGGCGACTGCTGCAAACCAT GGATAT GAT 

CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 
AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 
CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
ATATGCTACCAAGGAGAGAATAGAATACTCCAAAACCTTCAAGGGGAAATATTTTAATTTTCTTGGTT 
ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
Ul GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 
Q GAAGT TTT GGT CCCAACACAT TT CCT T CATTCTTGT T GGAATAATCAT CGT CACAT CCATCAGAGGAT 

JJJ TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 

CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATTT T TCACCT T CATAGCATACT CCT T CCCCGTCAGGTGATACTATGACCATGAGTAGCAT CAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAA.TACACTGGAACTCTGGGGCAAGACATGT 
CTAT GGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGT TCACAT GGAAAAGGTTATAGCTTTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE^ 

MS FL I DS S IMI T SQI LFFG FGWLFFMRQL FKDYE IRQYWQVI FSVT FAFS CTMFEL 1 1 FE I LGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRMVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TWYLGIQFDVKFWSQHISFILVGI IIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVS S VLL IRMSMPLE YRT 1 1 TEVLGELQFNFYHRWFDVI FLVSALS S I LFLYLAHKQAPEKQMAP 

Important features : 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin -binding proteins 

amino acids 151-160 
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FIGURE 5 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACT CT CCATCC GGAC TAGT TAT T GAGCATC T GCC T CT CATATCAC CAG T GGC CATC TGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTGCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTATGGGGATCACCCTTGTGAGCAAAAAGGCGAACCAGCAGCTGAATTTCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
AT TAT CACCACCAAAGAT CCCATAT TCAACACT CAAACT GCAACACAAACAACAGAAT T TAT TGT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
C T GC T CCAGCT TCCACT T C TAT TCCAC GGAGAAAAAAAT T GAT T T GT GT CACAGAAGT T T T TAT G 
GAAAC TAGCACCATGTCTACAGAAACT GAACCAT TTGTT GAAAATAAAGCAGCAT TCAAGAATGA 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
CAGCAGAAGGAAATGAT CGAAACCAAAGTAGTAAAGGAGGAGAAGGC CAAT GATAGCAACCC TAA 
T GAGGAAT C AAAG AAAAC T GAT AAAAAC C CAGAAGAG T C CAAGAG TC CAAG CAAAAC T AC C G T GC 
GATGCCT GGAAGC T GAAGT T TAGATGAGACAGAAAT GAGGAGACACACC T GAGGC TGGTTTCTTT 
CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCC^VAAGAACCAAAGAAGAAAGTCCA 
CCCTTGGTTCC TAAC T GGAATCAGC TCAG GAC TGCCAT TGGAC TATGGAGT GCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GC T C T G AAAG AGAAAC AC G TAT C C C AC C T GAC AT GTCCTTCT GAGC C C G G T AAG AGC AAAAG AAT 
GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
GCTAAAATAAAGAAATAGAACAAGGCTGAGGATACGA.CAGTACACTGT CAGCAGGGACT GTAAAC 
ACAGACAGGGT CAAAGT GT T T TC TC TGAACACAT TGAGT T GGAATCAC TGT T TAGAACACACACA 
CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAATAAAAAC TC T TATAAAT TTCTAT T T T TATCTGAGT TACAGAAAT GAT TAC TAAGGAAGAT T • 
ACT CAGTAAT T TGTT TAAAAAGTAATAAAAT T CAACAAACATT TGC TGAAT AGC TAC TAT AT GTC 
AAGTGCTGTGCAAGGTATTACACTCTGTAATTGAATATTATTCCTCAAAAAATTGCACATAGTAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
T T T TAT T T T T GC TGAGACTAAT C T TATT CAT T T TC T C T AATAT GGCAACCAT TATAACC T TAAT T 
TAT TAT TAACATACC TAAGAAGTACAT TGT TACCTC TATATAC CAAAGCACAT T T TAAAAGT GC C 
AT TAACAAAT GTATCACTAGCCCTCC TT T T TC CAACAAGAAGGGAC TGAGAGAT GCAGAAATAT T 
TGTGACAAAAAATTAAAGCATT TAGAAAACTT 
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FIGURE 6 

MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LSLAGKDQVETALKASFETCSYGOTGDGFWISRISPNPKCGKNGVGVLIWKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVLALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMI E TKWKEEKANDSNPNEE SKKTDKNPEESKS PSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 1.30-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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APP ID=10063736 



Page 160 of 322 



FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 
AAGCATCATCAGAAGTGAACCTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACAGAC 
ACGAAGGT T GGAAATAAT ACCAT C CATGTGCAC C GAGAAAT TCACAAGATAACCAACAACCAGAC 
TGGACAAAT GGTCT TT TCAGAGACAGTTAT CACATCTGTGGGAGACGAAGAAGGCAGAAGGAGCC 
, ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
^ ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
Q GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
Cl ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
£fl ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
||| CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
Q CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
g 4 GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
ql CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
4M CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
1., TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
p CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
Ip! CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
Q GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
[fl TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
f% TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
JS|j CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
^ f AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 

TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
C TCCACT AC CCCACAC CAGC C T TGGT GCCACCAAAAG T GC TCC C CAAAAGGAAGGAGAAT GGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
AT TGACACTGT CCC T C T T T GGCAGT T GCAT TAGTAAC T T T GAAAGGTATATGAC TGAGCGTAGCA 

TACAGGTTTVACCTGCAGAAACAGTACTTAGGTAATTGTAGGGCGAGGATTATATiATGAAATTTGC 
AAAATCACTTAGCAGCAACTGAAGACAATTATCAACCACGTGGAGAAjyVTCAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCGACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CAT GATT GT AT AAGCATGC T T TC T T T GAG T T T T AAAT TAT GT AT AAACAT AAGT T GCAT T T AGAA 
ATCAAGCAT AAATCACTT CAAC TGCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEW1ANLPPSYHNETNTDTKVGNNTIHVHREIHKITNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFAS FQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGT ICDNQRDCQPGLCCAFQRGLL FPVCTPLPYEGELCHDPASRLLDL I TWELE PDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 

327-331 

N-myristoylation site. 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 
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FIGURE Q 



CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
2 E -: GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
SJ ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
Ml GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
%?? GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
m AGAT TCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCT CCGG 



r~v. a 



"Star 

111 

f» •» 
>* 



CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAAC CTCCGTC TCTAT T AAAAATACAAAAAT TAGCC GAGAG TGGTGGCAT GCAC C T GT CAT CCCA 
GC TAC TCGGGAGGC T GAGGCAGGAGAATCGCT T GAACCCGGGAGGCAGAGGT TGCAGT GAGC CGA 

GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGAT T T TAT T AAAG AT AT T T T G T T AAC T C 
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FIGURE io 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NOTLALGQCVLAGAFAS FYWAFHKPQDIPTFPLISAFIRTLRYHTGSLAFGALILTLVQIARVIL 
E YI DHKLRGVQNPVARC IMCC FKCCLWCLEKFI KFLNRNAYIMI AI YGKNFCVS AKNAFMLLMRN 
I VRWVLDKVTDLLL FFGKLL WGGVGVLS FFFFS GRI PGLGKDFKS PHLNY YWL P IMT S I LGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 27 6-288 

Ammonium transporters proteins . 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCC6GGAGCCACCGCC ATG GGGGCCTGCCTGGGAGCCTGC 

TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 

CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 

TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 

GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 

CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 

GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 

ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 

CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 

GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 

TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 

CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATGGCTGCTGTCCTGC 

CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 

TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 

GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 

TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 

AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 

GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 

ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 

ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 

CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 

CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 

CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 

CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 

ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 

AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 

TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 

GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFL I LVGLTVGAFYI PDGS FTNI WFYFGWGS FLFI L IQLVLLI DFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
SFFHFCLVIASLHVmTLTNWYKPGETRKMISTWTAVWKICASWAGLLLYLWTLVAPLLLRNRD 
FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGTTGGAAAAAGACTCCTGTAACCCTCCTCCAGGATGAACCACCTGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCATTGAGAACACATTAGAGAAGGAGGTGATGCAGTATGACTACTATTCTTCATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
C T TGCC TGGAT TGAGACGTGGT TCC T GGAT T TCAAAG TGT TACC TCAAGAAGCAGAAGAAGAAAA 
CAGACT CCTGATAGT T CAGGATGC TTCAGAGAGGGCAGCAC T TATACC TGG T GG TC TT TCTGAT G 
G TCAGT T T TAT T CCCC TCCT GAATCCGAAGCAGGAT C T GAAGAAGC TGAAGAAAAACAGGACAGT 
GAGAAACCACTTTTAGAACTATGAGTACTACTTTTGTTAAATGTGAAAAACCCTCACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CACTGCTGGCTTTATTGAACAGCTAATAAAGATTTATTTATTGTAATACCTCACAAACGTTGTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAA.TAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAGAC T T AATATAT TGAAGTAACAC T T T T TTAGTAAGCAAGATAC CT T T T TAT TTCAAT T C AC 
AGAATGGAAT TTTTTTGTTT CATGTC TCAGAT T TAT T T T GT AT TTCTTTTT T AACAC T C T AC AT T 

TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 

ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 

CTTGCAAAAGGGGAPLGAAAGGAATTGCGAATACATGTAAAATGTCACCAGACATTTGTATTATTT 

TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 

ACAAAATGACT TAAACCAT TCATATCATGTT TCCTT TGCGT TCAGCCAATTTCAAT TAAAATGAA 
CTAAATTAAAAA 
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FIGURE id. 

MNHLPEDMSNALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWIIELNWGGIENTLEKEVMQYDYYSSYFDIFLLAVFRFKVLILAYAVCRLRHWWAIALTT 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 73-84, 79-85, 202-208, 217-223 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCT6CGCCGCCGC.CGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCGGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGC.TGCGGCCCGAGGACCTGGCCGGCCTGGC 
TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCGCTGAGGAGACGCGCTGCCA 
CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
GGCC TGTACTGT GAGAGCCAGAT GGGGCAGGGGACAC GGC CCAGCCC TACAC CAGTCACGCCGAG 
GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 

GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGAGCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGC TGTG TGAC T CTAGTC T TGGC CCCAGGAAGC GAAGGAACAAAAGAAAC T GGAAAGGAAGAT GC 
T TT AGGAACAT GTTTTGCTTTTT TAAAATAT ATATAT T TATAAGAGAT CC T T TCCCAT T TAT T C T 
GGGAAGATGTTTTTCAAACTCAGAGACAAGGACTTTGGTTTTTGTAAGACAAACGATGATATGAA 
GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHSITNETFRGLRRLERLY 
LGKNRI RH I QPGAFDT LDRLLELKLQDNE LRAL P PLRLPRLLLLDLSHNS LLALE PG I LDTANVE 

ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQE LDVSNLSLQALPGDL S GL FPRLRLLAAARNP FNCVC PLS WFGPWVRE S HVTLAS PEE 

TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
LIAAIAAVGAAYCTORGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 
amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-143, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTC ATG CG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCCTTAGATTCCAAGACTACTTTGACATCAGATGAGTCAGTA 
AAGGACCATAC TAC TGCAGGCAGAGTAGT T GC TGGT CAAATAT T TC T T GAT T CAGAAGAAT C TGA 
ATTAGAATCCTCTATTCAAGAAGAGGAA.GACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAGATATCAGCTTTCTAGAGTC TCCAAATCCAGAAAA.CAAGGAC TATGAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
TTTCCTAGATAAGGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTG 
C TACAACC TAT GAC TACAAAGCAGAT GAAAAGT GGGGC T T TT GT GAAACT GAAGAAGAGGC TGCT 
AAGAGACGGCAGATGC AGGAAGCAGAAAT GAT GTAT CAAACTGGAAT GAAAAT CC TTAATGGAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCAT AT C GGTAT C T CCAAAASGCAGCAAGCAT GAACCATA 
CCAAAGC CCT GGAGAGAGT GT CATATGC T CTT T TAT T TGG T GAT TACT T GCCACAGAATAT CCAG 
GCAGCGAGAGAGATGTTTGAGAAGCTGACTGAGGAAGGCTCTCCCAAGGGACAGACTGCTCTTGG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 
AAT AAT AT T AACAT CAGAAGAAT TTGTGGTT TATAGC GGC C AC AAC T T T T T C AG C T T T CAT GAT C 

CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 
AACACAT GGAATC TACAT GTAAAT GAAAGT T GG TGGAGT CCACAAT T T T TC TT TAAAATGAT TAG 

TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAAT T G T C AG AAT CAT T T T T TACAT TAG AT TAT CAT AAT T T TAAAAAT T T T T C T T TAGT T T T T C A 

AAATTTTGTAAATGGTGGCTATAGAAAAACAACATGAAATATTATACAATATTTTGCAACAATGC 
C C T AAG AAT T G T T AAAAT T CAT G GAG T T AT T T G T GC AG AAT GAC T C C AG AG AGC T C TAC T T T C T G 

TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT T AAC T CAT T T T T AAT AAAAT TAT G T C T AAGAT T AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMMYQTGMKIL'NGSNKKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S SQAKALV Y YT FGALGGNL I AHMVLVS RL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 2 67-271 

Microbodies C-terminal targeting signal. 

amino acids 2 99-303 

Type II fibronectin collagen -binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE iq 



AATT CAGAT T T TAAGCCCAT TC T GCAGTGGAAT T TCAT GAACTAGCAAGAG GACACCATCT T CT T 
GTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAA ATG CTCTTTTGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACATTT TTATCACTGGATGTGAC T CGGGCT T TGGAAAC TTGGCAGCCAGAACTTT TGAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
C CC GGCG T GCT GGC T CCCAC T GAC T GGC T GACAC T AGAGGAC TACAGAGAACC TATT GAAGT GAA 
CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTGCTTTGGTCAAGAAAGCTCAAGGGAGAG 
TTAT TAAT GT C TCCAGT GT TGGAGGTCGCCT TGCAAT CGTT GGAGGGGGCTATACTCCATCCAAA 

TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
AT GCAT TGAACCAGGAT T G T T C AAAAC AAAC T T GGC AGAT CCAGTAAAGGT AAT TGAAAAAAAAC 
TCGCCATTTGGGAGCAGGTGTCTCCAGACATCAAACAACAATATGGAGAAGGTTACATTGAAAAA 
IP AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 

CATGGAC CAC GC TC TAACAAGT C TC T TCC CTAAGAC T CAT TATGCCGCTGGAAAAGAT GCCAAAA 

I* ■ ■* 

T TT TCTGGATACCTCTGT CTCACATGCCAGCAGCTT TGCAAGACTT T T TATTGT TGAAACAGAAA 
m GCAGAGC T GGC TAATCCCAAGGCAG T GTGACT CAGC T AACCACAAAT GTC T CC TCCAGGCTATGA 

HI AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 

CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 



■? : *r. 



CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
AT CAT C T C T TAT C T AAAT AT T AAAAG AT AAG T CAAC C C AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 

MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPENVKRTAQWVKNQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 
RE P I E VNL FGL I S VT LNML PL VKKAQGRV I NVS S VGGRLAI VGGG YT P SK YAVE G FNDS LRRDMK 
AFGVHVS C I E PGL FKTNLADP VKVIEKKLAI WEQLS PDI KQQYGEGY IEKS LDKLKGNKS YVNMD 
LSPWECMDHALT SLFPKTHYAAGKDAKI FWI PLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain : 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



a .. « 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGATGATGTTGAAGTTGTTTATACAATTGA 
CAT TCAGAAATAT AT T CCAT GC TATCAGC T TT T TAGC TT T T ATAAT TC TT CAGGCGAAG TAAATG 
AGC AAG C AC T G AAG AAAAT AT TAT C AAAT G T C AAAAAG AAT G T GG TAG G T T G G T ACAAAT T C C G T 
CGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCTTCACAAAAACTTGCAGGAGCATTT 
T TC AAACCAAGAC CTTGTTTTTCTGC TAT T AACACCAAG T AT AATAAC AGAAAGC TGCT CT AC T C 
ATCGACTGGAACAT TCCT TATATAAACCTCAAAAAGGACTT TT TCACAGGGTACC TT TAGTGGT T 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
T T T T AGCC GAGC AGT ACAAAC ACAC AGC T CT AAAT T T T T T GAAGAAGAT GGAT CC T TAAAGG AGG 
T AC AT AAGAT AAATGAAAT GT AT GC T T CAT T AC AAGAGGAAT T AAAGAGT ATATGCAAAAAAGT G 
GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGATTAAAACGAGAAATTGA 
GAAAAGGAGAGGAGCACAGAT TCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCC TCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
ATGTCTTTA7\AAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
P AGACAATCTGACCT TAATGGTAGAACACACTGACAT TCCT GAAGCTAGTCCAGCTAGTACACCAC 

Hi AAATCAT TAAGCATAAAGCCT TAGACT TAGAT GACAGATGGCAAT TCAAGAGAT C TCGGT T GT TA 

M GATACACAAGACAAAC GAT CTAAAGCAAAT AC TGGTAGTAG T AACCAAGATAAAGCAT CCAAAAT 

P« GAGCAGCCCAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTTTTGGTGAATATTCACGGTCTC 
m CTACATT T TCAT CC T T TTAACC T TACAAGGAGAT T T T T T TAT T T GGC T GATGGG TAAAGC CAAAC 

ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
T GT T T GC AGT AAT ACACAGATAAC T C T TAG T GC AT T T AC T T C AC AAAG TAC T T T T T CAAAC AT C A 

GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCCTGGGCAACGTATT 
GAGACCAT GT C TAT TAAAAAATAAAAT GGAAAAGCAAGAATAGCC T TAT T T TCAAAATAT GGAAA 
GAAATT TATATGAAAAT T TAT C T GAGT CAT T AAAAT TC TC CT TAAGT GAT ACT TT TT TAGAAGTA 

CATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCAATAAATTTGCAAAACATCATCT 
AAAAT TTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGE S T S AVLS G FVLGALAFQHLNTDS DTEG FLLGE VKGEAKNS I TDSQMDDVE WYT I D I QKY I 
PCY QL FS FYNS S GE VNE QALKKI L S NVKKNWGW YK FRRH S DQ IMT FRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGC C GC GC GGCGGAGGGCAGAGT CAGCCGAGCC GAGTC CAG CC GGACGAGCGGAC CAGCGCAGGGCAGC C CAA 

GCAGCGCGCAG.CGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 

GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 

GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 

CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 

TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 

GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 

AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 

AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 

CAAGAGAT GGAGGAGAAGTACAGC CT GCAAAGTAAAGAT GAT T T CAAAAGT GT GGT CAGC GAACAGTGC AATCAT T TG 

CAAGCTGTCTTTGCTTCACGTTACAAGAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTG 
AAT GATAT GTTT GT GAAGACATAT GGC CAT TTATACAT GCAAAATT CT GAGCTATTTAAAGAT CT CTTCGT AGAGTTG 

AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 
TT C CGCCT GGT GAACT CCCAGTACCACT TT ACAGAT GAGTAT CT GGAAT GT GT GAGCAAGT ATACGGAGCAGCT GAAG 

CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 

TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 

ATGATCTACTGCTCCCACTGCCGGGGTCTCGTGACTGTGAAGCCATGTTACAACTACTGCTCAAACATCATGAGAGGC 

TGTTTGGCCAACCAAGGGGATCTCGATTTTGAATGGAACAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTA 
GAGGGT CCTTT CAACATTGAAT CGGT CATGGAT CCC AT CGAT GTGAAGAT T T C TGATGCTATT AT GAACAT GCAGGAT 

AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 
AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 
GTTTGCAACGATGAGAGGATGGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTAC 
CTGTTTGCAGTGACAGGAAAT GGATTAGCCAACCAGGGCAACAACCCAGAGGT CCAGGTT GACACCAGCAAACCAGAC 

ATACTGATCCTTCGTCIAAATCATGGCTCTTCGAGTGATGACCAGCAAGATGAAGAATGCATACAATGGGAACGACGTG 
GACTT CTTTGAT ATCAGT GAT GAAAGTAGT GGAGAAG GAAGT GGAAGT GGCT GT GAGTAT CAGCAGTGCC CTT CAGAG 
TTT GACT ACAAT GCC ACT GACCATGCT GGGAAGAGT GC CAAT GAGAAAGC C GACAGT GCT GGT GT CC GTC CT G GGGC A 
CAGGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAGATAATTCTCAAACTCTGAG 
AAAAAGT GTT CAT CAAAAAGT TAAAAGGCACCAGT TAT CACT T TT CTACCATC CTAGT GACTT TGCTT TTTAAAT GAA 
TGGACAACAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AGGAAAAGGGACTGTGCATTGAGTTGGTTCCTGCTCCCCCAAACCATGTTAAACGTGGCTAACAGTGTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
TCTGAAATAT TAAAT AGCT GTACAGAAGCAGGTT TTATTTAT CAT GT TAT CT TATTAAAAGAAAAAGC CCAAAAAGC 
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FIGURE 24 

MARFGLPALLCTIAVLSAALIAML^ 

CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 

LYMQNSELFKDLFVELKRYYWGNWLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 

QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSWNPTAQCTHALLKMIYCSHCRGL 

VTWPCYNYCSNIMRGCIANQGDLDFEWNNFIDAMIJWAERLEGPFNIESVMD 

MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 

EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 

KPDILILRQI3y[ALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 

KS ANEKADS AGVRPGAQAYLLT VFC I L FLVMQRE WR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCAT CCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCAT TATATTCT TCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 

TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGC T T CT AGGAGAT GGC T CCAGGAAGGC GGCCAAGAATGTGAGTGCAAAGAT TGGT TCC T GAG 

AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAAGTGTAGCTTCCT 
AGC T AGT G T CAT T T AACC T TAAATGCAAT C AGGAAAGTAGCAAACAGAAG T C AAT AAAT AT T T T T 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 

MKVLISSLLLLLPmmSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 4 6-52 
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FIGURE 27 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCC ATGG ACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
GAGCCCACCTGGAAACACAT TGGGGAT GGCTGCTGCCTCACCAGAGAGACC TGGAAGGATCT TGA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
T T CC C C AG C C T C CAAT T AGAACAAG C C AC C C AC CAG C C T AT C TAT C T T C CAC T GAGAGG G AC C TA 

GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGAC TCAAAGAGGCAGAGG TT T T GT TC TCAAATAT T TTT TAATAAATAGACGAA 
ACCACG 
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FIGURE 28 

MDILVPLLQLLVLLLTLPLHIMALLGCWQPLCKSYFPYLMAVLTPKSMRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDWVCTL VLCS VQS PRKVLQE VRRVLRPGGVL FFWEHVAE PYGS WAFMWQQVFE PTW 

KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQS FPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 

GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGT'CGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGTT TAT GAT T C T T ATAC TAAT T T AT AC AAAGAT AT T AAGGCCCT GT T CAT T AAGAAAT T 

GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE so 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE 31 



GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCGTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
t . CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
m AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
O TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
U\ ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTGGGGCATTCGGTACATGTGGTCGTACCATTT 
"ft AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
H J TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
rf! TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
3? GAT TCCGAGAAT CAT TGTCAT GTACAT GCAAAACGCACT GAAAGAACAGCAGCATGGT GCAT T G T 

W CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
%} AACCAGAATGCATATACTACAAC TGCTAT TAATGGGACAGATT TCTGTACATCAGCAAAAGATGC 

S ATTCAAAAT CT TGTCCAAGAACTCAAGTCACTT TACATCTATTAAC TGCT TTGGAGACT TCATAA 

TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GGAAAC AAA.T GAT G GAT C G T C AG AAAAG C C C T AC T T TAT G GAT C AAG AAT T T C T GAG T T T C G T A 
AAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGAATGAGGA 
GGGAACAGAACT CCAGGC CAT T G TGAG ATAGA TACC CAT T TAGGTATCTGTAC C T GGAAAACAT T 
TCCT TC TAAGAGCCAT TTACAGAATAGAAGATGAGACCAC TAGAGAAAAGT TAG T GAAT TT T T TT 
T TAAAAGACC TAATAAAC CC TAT TC T TCC T CAAAA 
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FIGURE 32 

MS GRDT I LGLC I LALALS 

LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
ILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDP PDHP I LS SLS I LFFYHQGTWKGS FL I SWRI PRI I VMYMQNALKE QQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI IFLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features : 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains : 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites . 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAAT TT GT AT GGGAT T T GTGAT GCAGGAAAGCC TAAGGGAAAAAGAATAT TCAT TC T GTGT GGT 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTTATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTGCCTCAGATCAACTGC 
GAT G T C AAAG C C GG AAAG AT CAT C GAT C C T GAG T T C A.T T G T G AAAT G T C C AG C AGG AT G C C AAG A 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
TATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATACTCATCAT 
CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATTCCAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
12 GCCACAGGAGCCAGGAGATGGATCTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 
W AGAGCTGAT CCAGGTATCCAAAGGCAAGAT CC T T CAGGAGC T GC C T TC CAGAAACC TGT TGGAGC 

(•##<■ « 

| :: | G GAT G T C AG C C T G G G AC T T G T T C C AAAAG AAGAAT T G AGC AC AC AG T C T T T G GAG C C AG TAT C C C 

fifs T GGGAGAT CCAAAC TGCAAAAT TGAC T TG T CGT T TT TAAT TGAT GGGAGCACCAGCAT T GGCAAA 

jUf CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 

CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 
ACACGAAT TCTCGAGATCTGAAGACAGCCATAGAGAAAAT TAC TCAGAGAGGAGGACTT TCTAAT 
Iy GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
||| GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
2 GACT TGCGAGAGAGTCAGGAATCAACATTT TCT TCATCACCAT TGAAGGTGCTGCTGAAAATGAG 

*** AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
Jjf GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
111 ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
-Q GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
£| AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
SJ. TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
y TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
III GTCCAAGCCCAACAAGAGGAAGT TAAT GAT CCT CAT CACC GACGGGAGGT CC TAC GAC GACG T CC 

GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
GT T T GACAACC T CCAT CAG TAT GTCC CCAGGATCAT C CAGAACAT TT GTACAGAGT TCAAC T CAC 
AGCCTCGGAACTGAATTCAGAGCAGGCAGAGCACCAGCAAGTGCTGCTTTACTAACTGACGTGTT 
GGACCACCCCACCGC T TAAT GGGGCAC GCACGG T GCAT CAAGT C T TGGGCAGGGCATGGAGAAAC 
AAAT GT CT TGT TAT TAT TCT T TGCCAT CAT GC T T T T T CAT AT TCCAAAACT T GGAGT TACAAAGA 
TGATCACAAACGTATAGAATGAGCCAAAAGGCTACATCAT GTTGAGGGTGC T GGAGATTTTACAT 
TTTGACAATTGTTTTCAAAATAAATGTTCGGAATACAGTGCAGCCCTTACGACAGGCTTACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
TTTTGTCATGACAATGTAGGAATTGCTGAATTAAATGTTTAGAAGGATGAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAA&AAAAAAAAAAAAAAAA?\AA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 



MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RS GAPNVWVMVDGW P T DKVEE ASRLARE SGINIFFITIE GAAENE KQYWE PN FANKAVCRTNG 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKS KPNKRKLMI L I T DGRS YDDVRI PAMAAHLKGVI TYAI GVAWAAQEELE VI ATHPARDHS FF 
VDE FDNLHQ YVPR I I QNI CTE FNS QPRN 

Important features: 
Signal peptide: 

amino acids 1-2 6 



N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Ami da ti on site. 

amino acids 304-308 




Transmembrane domain: 



amino acids 181-200 
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FIGURE 

CCGAGGACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGAAGAAATTGC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GG AT GT CACT GAGATC CC TCAAAT GGAGCC TCCT GCTGCT GT CACTCC TGAGT T TCT T TGTG ATGT GGT AC 
CTCAGC CTTC CC C ACT AC AAT GTGAT AG AACGCGT G AACT GGAT GT AC TTCT AT GAGT AT GAGCCG AT T T A 
CAGACAAGAC TT TC AC TTCAC ACT TCGAGAGCAT T C AAACTGCTCTC ATC AAAATCCATT TCT GGT CATTC 
TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
TGGT GGGGAT AT GAGG TT CTT ACATT T T TC TT AT T AGGCCAAGAGGCT GAAAAGGAAGAC AAAAT GTT GGC 
AT T GTC CT T AGAGG AT GAAC AC CT TCT T T ATGGT GACAT AAT CC GACAAG ATT T TT T AGACAC AT ATAAT A 
ACCT GACCT T GAAAACCATT AT GGC AT T C AGGT GGGTAACTG AGT TTT GCCCCAAT GCCAAGT AC GTAATG 
AAGAC AGAC ACTGAT GTT T TC ATC AAT ACT GGCAAT TT AGT G AAGTAT CT T TTAAACCT AAACC AC TC AG A 
GAAGTT T TT C AC AGGT T AT CCT CT AATT GAT AAT T ATT CCT ATAGAGG AT T T TACC AAAAAACCCAT ATTT 

CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
TTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTTGAAGATGTTTATGTCGGGAT 
CTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGACACAAATCTTTTCTTTCTATATAGAATCC 
ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 
T GGC AGGTCATGCT AAGGAAC ACC AC AT GC C ATT AT TAACT T CAC AT T CT AC AAAAAGCC TAGAAGGACAG 
GATACC TT GT GGAAAG TGTT AAAT AAAG T AGGT ACT GT GGAAAATTCATGGGG AGGT C AGTGTGCT GGCTT 
AC AC TG AACT GAAACT C ATG AAAAAC C C AG AC T G GAGACTGG AGGGT T AC ACTT GT GAT T TAT T AG TC AGG 
CCCTTC AAAG AT GAT ATGT G GAGG AAT T AAAT AT AAAGGAAT TGGAGGTT T TTGCT AAAG AAAT TAAT AGG 
ACC AAAC AAT TT GGAC AT GT CATT CT GT AG ACTAGAAT TTCT TAAAAGGGT GT T ACTGAGTT AT AAGCTC A 

CTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTGAACAATGTAGTCACTTGAAGGTTTTGTGTA 
T ATC T T AT GT GG ATT ACC AAT T T AAAAAT ATATGT AGT T CT GTGTCAAAAAACT TCT TCACT GAAGTT AT A 

CTGAACAAAATTTTACCTGTTTTTGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATT 
ATTATT TAAAAT T AC T TC AACTTT GT GTT T TT AAAT GT T TT G ACGAT T TC AAT ACAAGAT AAAAAGGATAG 
TGAATC AT T CTT TACATGCAAACATT T T CC AGT T ACTT AACT GAT CAGTT TAT TAT T G AT ACAT CACT CCA 

TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AAT AT AGAGAAGAAT TAAAGCAAG AAAAT CTGAAAA 
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FIGURE 36 



MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYNVIERVNWMYFYEYEPIYRQDFHF 
TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMIiA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KP IKFEDVYVGI CLNLLKVNI H I PE DTNL FFLYRI HLDVCQLRRVI AAHGFS S KE 1 1 T FWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 



N-glycosylation sites. 



amino acids 72-76, 154-158, 198-202, 212-216, 326-330 



Glycosaminoglycan attachment site. 



amino acids 239-243 



Ly-6 / u-PAR domain proteins. 




N-myristoylation site. 



amino acids 23-37 



amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAG GATGG AGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCTTGCCAAGAGAGTACACAGTCATTAATGAAGCCTGCCCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTG 
AATATGATCAGATTGAGTGCGTCTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACC T TGGATGAC T TC TATGT GAAGGGGT TCTAC TGT GGAGAGTGCCGAGCAGGCT GGTACGGAGGAGAC TGC ATGCGATGTG 
GCC AGGT TC TGC GAGCC CCAAAGGGTC AGAT T T TGT TGGAAAGC T ATC C CCTAAATGCTCACTGTGAATGGACCAT TCATGC 
TAAACCTGGGTT TGT CATCCAAC TAAGAT TT GTCAT GT TGAGTC TGGAGT TTGAC T AC ATGTGCCAGTATGAC TAT GT TGAG 
G TTCGTGATGGAGACAACCGC GATGGC C AGAT C ATCAAGC GTGTC TGT GGCAACGAGCGGC CAGC TCC TATCCAGAGCAT AG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TATAC TGGGCAGC GC TGTGAAAATC TCC T TGAAGAAAGAAACTGC TCAGACCC TGGGGGCC CAGTCAATGGGTAC C AGAAAA 
TAAC AGGGGGC CCTGGGCT T ATCAACGGACGC CATGC TAAAAT TGGCACCGTGGT GTCTTTCT T TTGTAAC AACT C C TATGT 
TCT TAGT GGCAAT GAGAAAAGAAC T TGCC AGCAGAATGGAGAGTGGT CAGGGAAACAGCCCAT C TGCATAAAAGC C T GCC GA 
GAACC AAAGATT TCAGACC T GGTGAGAAGGAGAGT T C TTC C GATGCAGGT TC AGTC AAGGGAGACACC ATTACAC CAGC TAT 
ACTCAGCGGCCTTCAGCAAGCAGAAACTGCAGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGG 
ATACCAACATC T GCATACC C AGC TCCAGT AT GAGT GC ATC TCACC C T TC TACCGC C GCC TGGGCAGCAGCAGGAGGACATGT 
C TGAGGAC TGGGAAGTGGAGTGGGCGGGC AC CATC C TGCATCC CTATC TGCGGGAAAAT TGAGAACATC AC TGCTCCAAAGA 
C CC AAGGGTTGCGCTGGCC GTGGC AGGCAGCCATC TAC AGGAGGACCAGCGGGGTGC ATGACGGCAGC C TACACAAGGGAGC 

GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
GTCACCATGATCAAGACAGCAGACCTGAAAGTTGTTTTGGGGAAATTCTACCGGGATGATGACCGGGATGAGAAGACCATCC 
AGAGC C TACAGAT TTC TGC TATC AT TC TGCATCCC AACTAT GACCCC ATCC T GCT TGAT GC TGAC ATCGC CAT CC TGAAGCT 
C CTAGAC AAGGCCCGT ATC AGC AC CC GAGTCC AGC C C ATC T GC C TCGC TGCC AGT C GGGAT C TC AGCAC TT CC TTC C AGGAG 
TCCCACATCACTGTGGCTGGCTGGAATGTCC TGGC AGACGT GAGGAGCCCT GGCT T CAAGAACGACAC AC T GCGC TC TGGGG 
T GGTCAG TGTGGTGGAC TC GC TGC TGTG TGAGGAGCAGCATGAGGACC ATGGCATCC CAGTGAGTGTCAC TGATAAC ATGTT 
C TGTGCC AGCTGGGAACCC AC TGCCCC T TCT GATAT CTGCACTGC AGAGACAGGAGGCATC GC GGCTGTGT CC TTC CCGGGA 
C GAGCATC TCC TGAGCC AC GC TGGCATC TGATGGGACTGG T CAGC TGGAGCTATGAT AAAACATGCAGCCACAGGC TCTC CA 
CTGCCTTCACCAAGGTGCTGCCTTTTAAAGACTGGATTGAAAGAAATATGAAATGAACCATGCTCATGCACTCCTTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGCTGGTAGGCTGATGCCGCGTCCA 
C TAC T AGGACAGC CAAT TGGAAGATGCC AGGGC T T GC AAGAAGTAAGTT TC T TCAAAGAAGACCATATACAAAAC CT C TC CA 

CTCCACTGACCTGGTGGTCTTCCCCAACTTTCAGTTATACGAATGCCATCAGCTTGACCAGGGAAGATCTGGGCTTCATGAG 
GCCCC TT TTGAGGC TC TCAAGT TC TAGAGAG C TGC C TGTGGGAC AGCCCAGGGCAGC AGAGCTGGGATGTGGT GC AT GC CTT 
T GTGTACAT GGC CAC AGTACAGTC TGGTC CT TT TC C T TCCCC AT C TC T TGTAC ACAT TT TAATAAAAT AAGGGTT GGCT TC T 
GAAC T AC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 



MELGCWTQLGLTFLQLLLISSLPREYTV.INEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQ I LLE S YPLNAHCEWT I HAKPG FVI QLRFVMLS LE FD YMCQYDYVE VRDGDNRDGQ 1 1 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGE WS GKQP I C IKACREPK I S DLVRRRVLPMQVQSRE TPLHQL YSAAFS KQKLQS APTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVWAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKT IQSLQI SAI I LHPNYDP ILLDADIAI LKLLDKARI STRVQPI CLAASR 
DL S T S FQE SHI T VAG WNVLADVRS PG FKND T LRS G WS WD S L LCEE QHEDHG I PVS VT DNMFC A 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide : 

amino acids 1-23 

EGF-like domain cysteine pattern signature. 

amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites . 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids. 150-167 
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FIGURE 39 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAG AATGA AGGCGCTTCTGTTGC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGT GCCT CC CACTACGGCCTGAC CAAAGAT AGGAAGAGGCGCT CACAAGAT G GCTGT C CAGAC GGCT GTGCGAGCC 
T CACAG CC AC GGCT C C CT CCCCAGAGGT TT CT GCAGCT GCCAC CAT CT CCTTAAT GAC AGACGAGCCT GGC CTAGACA 
ACC CT GCC TAC GT GT C CT CGGCAGAGGAC GGGCAGC CAGCAAT CAGCC CAGTGGACTCT GGC C GGAGCAAC CGAACTA 
GGGCACGGCC CTTT GAGAGAT CCACTAT TAGAAGCAGAT CATTTAAAAAAATAAAT CGAGCTT T GAGT GTT CT T C GAA 
GGACAAAGAG C GGGAGTG CAGTTGC CAAC CAT GC CGACCAGGGCAGGGAAAATT C T GAAAACACC ACT G C C CCT GAAG 
T CTT T CCAAG GTT GTACCACCTGATT CCAGAT GGT GAAAT TAC CAGCAT CAAGAT CAAT C GAGT AGAT C C CAGT GAAA 
GCCT CT CTATTAGGC T GGT GGGAGGTAGCGAAAC CC CACT GGT CCATAT CATT AT C CAACACATTTAT CGT GAT G GGG 
1*1 TGATCGCCAGAGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATGTCCCTC 
U ACAACTACGCT GT GC GTCT C CTGC G GCAG C C CT GCCAGGT GCT GT G GCT GACT GTGAT GCGT GAACAGAAGTT C C GCA 

GCAGGAAC AAT GGACAGGCC CCGGAT GCCTACAGAC CCC GAGAT GACAGCTTT CAT GT GAT TCT C AACAAAAGTAGCC 
Mil . CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 

'<i. s 

GTGT GGCATAT CGACATG GT CAGCTT GAGGAGAAT GACC GT GT GT TAGCCAT CAAT GGACATGAT CTT C GATAT G GCA 

j! A 

1*1 GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 

y 1 . GGAGCCCT GACAT CTT TCAGGAAGCC GGCT GGAACAGCAAT GG CAGCT GGT CC C C AGG GC CAGGGGAGAG GAGCAACA 

~ CT C CCAAGCC C CT CCAT CCTACAATT ACT T GT CAT GAGAAGGT GGTAAAT ATC CAAAAAGACCC C G GT GAATCT CT CG 

M GCAT GACC GT C GCAGGGG GAGCAT CACATAGAGAAT GGGAT TT GCCTAT CTAT GTCATCAGTGTT GAGC CC GGAGGAG 

Ul TCATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAATGTGGATGGGGTCGAACTGACAGAGGTCAGCC 

IkI GGAGTGAGGCAGTGGCATTATTGAAAAGAACATCATCCTCGATAGTACTCAAAGCTTTGGAAGTCAAAGAGTATGAGC 

sip i r *«i 

*M C CCAGGAAGACTGCAGC AG C CCAGCAGCCCT GGACT CCAAC CACAACAT GGCCCCACCCAGT GACT GGT CC CCATC CT 

W GGGT CAT GT GGCT GGAATTACCACGGTGCTT GT AT AACT GTAAAGATAT T GTAT T ACGAAGAAACACAGCT GGAAGTC 

I II T GGGCTT CTGCAT T GTAGGAGGT TAT GAAGAATACAAT GGAAACAAAC CTTTT T T CAT CAAAT CCATT GTT GAAGGAA 

C AC CAGCATACAAT GATGGAAGAATT AGAT GTGGT GATAT TCTTCTTGCTGT CAAT GGTAGAAGT ACAT CAGGAAT GA 
TACAT GCTTGCTT GG CAAGACTG C T GAAAGAACTTAAAGGAAGAATT AC T CTAACTATT GT TT CT T GGC CT GGC ACTT 
TTTT ATAGA ATCAATGATGGGTCAGAGGAAAACAGAAAAATCACAAATAGGCTAAGAAGTTGAAACACTATATTTATC 
TTGTCAGTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTCAGGAAAAGTATGATCATCTAATGAAAGCCAGTT 
ACAC C T CAGAAAAT AT GATT CCAAAAAAATTAAAACTACT AGTTT T TTT T CAGT GT GGAGGAT T T CT CATTACT CT AC 
AACATT GTT T AT AT TT TTT CT ATT CAATAAAAAGCC CTAAAACAACTAAAATGATT GAT TT GTATACCC CACT GAATT 
CAAGCT GATT TAAATT TAAAATTT GGTATAT GCT GAAGT CT GC CAAGGGTACAT T ATGGC CAT T TT TAATT TACAGCT 
AAAATATTTTTTAAAATGCATTGCTGAGAAACGTTGCTTTCATCAAACAAGAATAAATATTTTTCAGAAGTTAAA 
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FIGURE 40 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVS AAAT I SLMTDE PGLDNPAYVS S AEDGQPAI S PVDSGRSNRTRARPFERSTI RS RS FKKINR 
ALS VLRRTKS G S AVANHADQGRENS ENTTAPEVFPRL YHL I PDGE ITS I KI MRVDP S ES LSI RL V 
GGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLLRQPCQVLWLT^TM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-38.8 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 

ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACTAATGAAGCATCCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
T AAAAAGT TTAATCGAGCCAAACTCTT GAATGT GGGCTATCTAGAAGCCCT CAAGGAAGAAAATT GGG 

ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
CCTGAAGTGGGTAAATATACAAT GGTC TT CCACACT AGAGACAAAGGCAATGAGGT GAACGCAGAACG 

GATGAAGCTCTTACACCAAGTGTCACGAGTCTGGAGAACAGATGGGTTGAGTAGTTGTTCTTATAAAT 
T AGTATCTGTGGAACACAAT C CT T T ATATATCAACATCACAGT GGAT TT CT GGTT T GGTGCATGACCC 

TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
TAGCAGAGCTCCT GGT GAT GTAGAGTATAAAACAGTTGTAACAAGAC AGCT T TCTTAGT CAT TT T GAT 
CATGAGGGTTAAATAT T GTAATATGGATACTT GAAGGACT TTATATAAAAGGATGACTCAAAGGATAA 
AATGAACGCTATTTGAGGACTCTGGTTGAAGGAGAT TTAT TTAAATTTGAAGTAATATATTATGGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
C TT CAGGGGAGGACCTGCCCAGGTAT GCCTTC CAGT GAT GCCCACCAGAGAATACATTCTCTAT TAGT 
TTTTAAAGAGTTTTTGTAAAATGATTTTGTACAAGTAGGATATGAATTAGCAGTTTACAAGTTTACAT 
ATTAACTAATAATAAATAT GT CTATCAAATACCTC T GTAGTAAAATGT GAAAAAGCAAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EAS TKKVE LDNC PS VS PYLRGQS KL I FKPDLTLEE VQAENPKVS RGRYRPQECKALQRVAI LVPH 
RNREKHLMYLLEHLHP FLQRQQLDYGI YVI HQAEGKKFNRAKLLNVGYLEALKEENWDC FI FHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 



APP ID=10063736 



Page 196 



FIGURE A3 

GC TCAAGAC CCAGCAGT GGGACAGCCAGACAGACGGCACGATGGCAC TGAGC TCCCAGATC T GGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAftAAAAAAAAAAAAAAAAAAAAJ^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Ad 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRA.GARASWMPMFQRRRRRDTHFPI 
C I FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site* 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE A* 



GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACC T TCAACACAACC CC TCTT GTCACCATACAGCCAGAAGGGGGCAC TAT CATAGT GACCCAAA 
ATCG TAATAGGGAGAGAGTAGAC T TC CCAGATGGAGGCTACT CCC T GAAGC T CAGCAAACT GAAG 
AAGAATGACTCAGGGATCTAGTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATT 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
G T T T CT GAAGAGAGAGAGAC AAGAAGAGT ACAT T GAAGAGAAGAAGAGAGT GGAC AT T TGT CGGG 
AAACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGT T TACTCCACTGTGGAAATACCGAAAAAGAT 
GGAAAAT CCCCAC TCACTGC TCACGAT GCCAGACACACCAAGGC TAT T T GC C TAT GAGAATG TTA 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE d6 



MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVD I CRE T PNI CPHS GENTE YDT I PHTNRT ILKE DPANT VYS T VE I PKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 



GGCTCGAGCGTTTC TGAGCCAGGGGTGACCATGA.CC TGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAATTGTGAATTTTCATTGAAAAACATCAGTGACATTCATCCAGAATCCTTCAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCATGGCGAGTGGCT GGAGAGCATCTAGT TTCCACT TCGAT TC TGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
TGTAGT T T AAT GGGAATAAAAT GTAAGTAT CAGTAGT T T GAAAAAAAAAAA 
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FIGURE 48 



MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMSLTARKRACCNNRTGMFLS S FFSVI TVI GALYCML IS IQALLKGPLMCNS PSNSNANCE FSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features : 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 



3 3 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGT GAGGAACCT GCAGAGT CACACAGT TGC T GAC CAATTGAGCTG TGAGCC TGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCT GGC CCT T C T C C TAC TGGCAGGCC T GAC TGCC T T GGAAGC CAAT GAC C 
CAT T TGCCAATAAAGACGATCCC T T C T ACTATGACTGGAAAAACC TGCAGC TGAGCGGAC T GAT C 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
C AGCC AGAAGC AGC ACAGT CC T GTAC C T GAGAAGGC CAT C CCAC T C ATCAC T C CAGGC T C T GC C A 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 

MERVT LALLLLAGL TALE ANDP FANKDDP FYYDWKNLQLS GL I CGGLLAI AG I AAVL SGKCKYKS 
S QKQHS PVPEKAI P L I T PG S ATTC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE si 



G TGGACT CT GAGAAGCCCAGGCAGT TGAGGACAGGAGAGAGAAGGC T GCAGACC CAGAGGGAGGG 

AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 

AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 

GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 

CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 

GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 

GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 

AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCGCATGCTCTGGGAAACACTGGGCACGAGA 

TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 

GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 

CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 

GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 

GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 

AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 

ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 

AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 

TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 

CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 

CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 

GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 

GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 

GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 

GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 

AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAAGCTTAGCTGCCCCACAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 
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FIGURE sa 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCC CCCAAAAC GGAACT GGT T T TGGGG TCACC T GGGCC T GATCAC TC CTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 
?*t CCT CAGAGGGCAGCAGT C GTCT GGACAT GT T T GAGCAC ATCAGC CT CAT GACC T TGGACAGT CTA 

is § CAGAAAT GCAT C T TCAGC T T TGACAGCCAT T GT CAGGAGAGGCCCAGTGAATATAT TGCCAC CAT 

Si CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 



Li 



3 5 i; 



ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGCCAAGTCCAAGACTTT GGAT TTCATTGATGTGCT TCTGCT GAGCAAGGATGAAGATGGGAAGG 
CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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. FIGURES A 

MSLLSLPWLGLRPVAMSPWLLLLLWGSWLLARIIJ^WTYAFYNNCRRLQCFPQPPKRNWFWGHLG 
LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDS LQKC I FS FDSHCQERPSE YIAT I LELSALVEKRS QHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAP FI SRCCTQD I VLPDGRVI PKGI TCL I D 1 1 GVHHNPTVWPDPEVYDPFRFDPENSKGRS P 
LAF I P FSAGPRNC I GQAFAMAEMKWLALMLLH FRFLPDHTE PRRKLEL IMRAEGGLWLRVEPLN 
VGLQ 

Important features: 
Transmembrane domains: 

amino acids 1.3-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE 55 



AT CGCAT CAAT T GGGAGTACCATCT T CC TCATGGGACCAG T GAAACAGC T GAAGCGAATG T T T GA 
GCCTACTCGTTTGATTGC7\ACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAA TT CATGGCCAGT T T TATGAAGC T T T GGAAGGCACT AT GGACAGAAGCTGGTGGACAG T TT T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
G T GAT T C G AAC AT T T GAG G G T T AC T T T T G G AAGC AAC AAT AC AT T C T C GAAC C T GAAT G T C AG T A 
H GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
If TCTGGAT GT T GTCCCACT GAAT TCC CATGAATACAAACC TAT TCAGCAACAGCAAAA?W\AAAAA 

m AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

•s. i: 



tj 

" iZ IT. 

J ; : 

■-■V'* T- 

S" 
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FIGURE *6 

MGPVKQLKRMFEPTRLIAT IMVLLC FALTLCSAFWWHNKGLAL I FC I LQSLAL TWYS LS FI PFAR 
DAVKKC FAVC LA 



Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen -binding domain protein* 

amino acids 30-72 
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FIGURE 57 

CGGCTCGAGCTCGAGCCGAATCGGGTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 

TAC GT GCC G GT CAT C GGGGAAGCCCAGAC CGAGT T C CAGTACTT T GAGT C GAAGGGGCT C CCT GCCGAGCTGAAGTC C 
ATTTTCAAGCTCAGTGTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCT 
GGAGAT AAGGAC CT T GAT GGGC AGCTAGACTTT GAAGAAT TT GT CC AT T ATCT C CAAGAT CAT GAGAAGAAG CT GAGG 
CTGGT GTTTAAGATTTT GGACAAAAAGAAT GATGGACGCATT GACGCGCAGGAGATCATGCAGT CCCTGCGGGACTTG 
GGAGTCAAGATATCTGAACAGCAGGCAGAAAAAATTCTCAAGAGCATGGATAAAAACGGCACGATGACCATCGACTGG 

AAC GAGT GGAGAGACTACCACCT C CT C CACCC CGT GGAAAACAT CCCCGAGAT CAT CCT CTACT GGAAGC AT T C CAC G 
AT CT TT GAT GTGGGT GAGAAT CTAACGGT C CC GGAT GAGT TCACAGT GGAGGAGAGGCAGAC GGGGAT GT GGT GGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 

CAGGTCCAT GCCT CCCGCAGCAACAACAT GGGCAT CGT TGGTGGCTT CACTCAGATGAT T CGAGAAGGAGGGG C CAGG 
T CACT CT G GC GG G GCAAT GG CAT CAAC GTC CT CAAAATT GCC C CCGAAT CAG C CAT CAAATT CAT GGCCT AT GAGCAG 
AT CAAGCG CCTT GT T GGTAGTGACCAG GAGACT CT GAGGATT CACGAGAGG CT T GTGGCAGGGT CCTT GGCAGGGGC C 
* ; AT CG CCCAGAG CAGCAT CTACC CAATG GAGGT CCT GAAGACC CGGAT GGCG C T GC GGAAGACAGGCCAGT ACT CAGGA 

?** AT G C T GGACT G CGCCAGGAGGAT CCT GGCCAGAGAGGGGGT GGCCG CCTT CTACAAAG GCTAT GT CC CC AACAT G CT G 

Q GGCAT CATCC CCT AT GC CGGCAT CGAC CTT GCAGT CTACGAGACGCT C AAGAAT GCCT GGCT GCAG CACT AT GCAGT G 

S| AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 

CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
||1 TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
§ ,. H GCTGTGAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGC 

CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
X$. AAG CT GT CTC GAGCCAAGCT GT GAAAACCCTAGAC GCACCC G CAGG GAGGGT GGG GAGAGCT GGCAGGCCCAGGGCT T 

| , i GT C CT GCT GAC CC CAGCAGACC CT CCT GTT GGT T CCAG CGAAGACCACAG GCATT CCTTAGG GT C CAGGGT CAGCAGG 

til CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
pl TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 

CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
! :E , ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 
U TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
m CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
Zl GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
|,| CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 

m ACT GTT GGGAAAAGGGTT TT GT C CAGAAGGACAAGC CG GACAAAT GAGCGACTTCT GT GCT T CCAGAGGAAGAC GAGG 

2; GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 

CI G GGAC CAGCC CCACATT C CACTT GT GT CAC T GCTT GGAACCT ATT TATTTT GTAT TTAT TT GAACAGAGT T AT GTCCT 

fU AACT AT T TTTAT AGAT TT GTT TAATTAATAGCTT GT CATTTT CAAGTT CAT TTT TTATT CAT AT TTAT GT T CAT GGTT 

GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATT CCT TTTG GGACT GGAGGCAGAAAAGCGGCCAGAAGGCAGCAGC CCT GGCT CCT TTCCTTTGGC AG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 

AAC CT T GAAGGTGGAAT CCAGTTAT TTCCTGCGCT GCGAGGGTT T CTT TATTT CAC TCT TT T CT GAAT GT CAAGGCAG 
T GAGGT GCCT CTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAATCTTCAATAGGATGCAAAGATCAAT GCAAAAATTGTTATATATGAACATATAACTGGAGT CGT CAAAAAG 
C AAATTAAGAAAGAAT T G GACGT T AGAAGTT GT CATTTAAAGCAGC CTT CTAATAAAGT T GTTT CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 58 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PE 1 1 LYWKHST I FDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVLMQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
ILAREGVAAFYKGYVPNMLGI IPYAGIDLAVYETLKNAWLQHYAVNSADPGVFVLLACGTMSSTC 
GQLASYPLALWTRMQAQASIEGAPEVTMSSLFKHILRTEGAFGLYRGLAPNFMKVIPAVSISYV 
VYENLKI TLGVQS R 

Important features : 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains : 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE fgQ 



GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
T T CCC T GGGGC AGAT CC T C T T C T GGAGCATAAT T AGCAT CAT CAT TAT T C T GGC T GGAGC AAT T G 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACAT TGGGGAGGAT GGAATCCTGAGCT GCAC T TT T GAACC TGACAT CAAACT T T CT GATAT 
G G TGAT ACAAT GGCTGAAGGAAGGT GTT T TAGGCT T GGT CCAT GAGT T CAAAGAAGGCAAAGAT G 
AGC T GT C GGAGCAGGAT GAAATGT TCAGAGGCCGGACAGCAGTGT T TGC T GATCAAGTGATAGT T 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
C AT C AC T T C T AAAG GCAAGG G GAAT GCT AAC C T T GAG T AT AAAAC T GGAG C C T T CAGC AT GC C GG 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
TCAACAACACATACTCCTGTATGATTGAAAATGACATTGCCAAAGCAACAGGGGATATCAAAGTG 
ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAA ATAA T 
GTGCCTTGGCCACAAAAAAGCATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCAC 
CACCAGATAT GACC TAGT TT TATAT T TC T GGGAGGAAAT GAAT T CATAT C T AGAAGTCTGGAGT G 
AGCAAACAAGAGCAAGAAACAAZVAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
C TAT CT T CAAAGACATAT TAGAAGTTGG GAAAATAAT T CATGT GAAC T AGACAAGTG TGT TAAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
T C T TCC CAAC TGACAAAT GCC AAAGT T GAGAAAAAT GAT CAT AAT T TTAGCAT AAAC AGAGC AG T 
C GGGGACAC C GAT T T T AT AAATAAAC T GAG CAC CTTCTTTT T AAAC AAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAA&AAAAA 
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FIGURE 60 



MASLGQILFWSI ISI I I ILAGAIALI IGFGISGRHS ITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTVVWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVS S FFAI S WALL PL S P YLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 



N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 



Transmembrane domain: 



amino acids 258-281 




N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 6l 



T GAC GT CAGAAT CACCATGGC CAGC TAT CC T TACCGGCAGGGC T GCCCAGGAGC TGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGAC T CAGATCACAGTGGC TAT AT C T C CAT GAAGGAGCTAAAGCAGGCCC T GGT CAACT GCA 
ATTGGTCTT CAT TCAAT GAT GAG AC C T G C C T CATGAT GAT AAACAT G T T T GACAAGAC C AAG T C A 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
CCATGACAGCT TC TCGGAT GC TATGACC CAACCATC T GT GGAGAGTGGAG T GCAC CAGGGACCT T 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AATAGTGAGGACCGGGGCT GAGGCCACACAGATAGGGGCC T GATGGAGGAGAGGATAGAAGT T GA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCA7\ACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
ATTTGGGGCCAAAAGTCCAGTGAAATTGTAAGCTTCAATAAAAGGATGAAACTCTGA 
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FIGURE 62 

MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SG Y I SMKE LKQALVNCNWS S FNDETCLMMINMFDKTKS GRI DVYGFS ALWKFI QQWKNLFQQYDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNIRLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 



CAGG ATG CAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
T G C T CAT C T C C AAT AAAAT AAAAGC AC T TAT G AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 64 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
L~ Signal peptide: 

f*| amino acids 1-26 

■Wl Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

Z'il 

**!*■<.. 

IS S 

r« m V 
•T- N V 

s N-myristoylation sites. 

D amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 

Ul 259-264, 269-274, 270-275, 280-285, 281-286, 305-310 

w 
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FIGURE 6 s 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACAC ATGA CAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 



MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARSQHHWCNT $ 

Important features : 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 




N-myristoylation sites. 



amino acids 15-21, 19-25, 60-66 
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FIGURE 67 

ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAAT GAGAATAGCACT GT T T TGCC T TCAT CCAC CAGCTC CAGC TCCGAT GGCAAC CTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATC TAGG TCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCT CTC T GGGCAGTCAGAT CCACCCAGTGCT TAATAGCAGGGAAGAAGGTAC T T CAA 
AGAC TCT GCCCCTGAGGT CAAGAGAGGAT GGGGCTAT T CACTT T TAT AT AT TTATATAAAAT TAG 
TAG TGAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 

MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAMjLLAVGLALLVRKLREKRQTEGT YRPS SEEQFSHAAEARAPQDSKETVQGCLP I 

Important features : 
Signal peptide: 

amino acids 1-19 

r sl * Transmembrane domain: 

z;l amino acids 56-80 

hi N-glycosylation site. 

\1 amino acids 36-40 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

r*f amino acids 86-90 

Ml Tyrosine kinase phosphorylation site. 

it H g 

T:f amino acids 86-94 

.1 ,» - 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 



G C C AGGAATAACT AGAGAGGAACAATGGGGTTATT CAGAGGT TTTGTTTT C CT CTTAGT TCTGTGCCTGCT GCAC C AG 
. T CAAATACTT C CTT CATTAAGCT GAATAATAAT GG CTT TGAAGATATT GT CATT GT TATAGAT CCTAGTGTGCCAGAA 
GAT GAAAAAATAATT GAACAAATAGAGGATATGGT GACTACAGCTT CTACGTACCT GT TT GAAGCCACAGAAAAAAGA 
TTTTTTTTCAAAAATGTATCTATATTAATTCCTGAGAATTGGAAGGAAAATCCTCAGTACAAAAGGCCAAAACATGAA 
AAC CATAAACAT GCT GAT GTTATAGTT GCAC CACCTACAC T CC CAGGTAGAGAT GAACCATACAC CAAGCAGT T CACA 
GAATGTGGAGAGAAAGGCGAATACATTCACTTCACCCCTGACCTTCTACTTGGAAAAAAACAAAATGAATATGGACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
TAC CGT GCTAAGT CAAAAAAAAT C GAAGCAACAAG GTGTT C CGCAG GT AT CTCT GGTAGAAATAGAGTTTATAAGT GT 
CAAGGAGGCAGCT GT CTTAGT AGAGCAT GCAGAATT GATT CTACAACAAAACT GTATG GAAAAGAT T GTCAATT CTTT 
CCT GATAAAGTACAAACAGAAAAAGCAT C CATAAT GTT TAT GCAAAGT ATT GAT T CTGTT GTT GAATTTT GTAACGAA 
AAAAC C CATAAT CAAGAAGCT CCAAGC CT ACAAAAC AT AAAGT GCAAT TTTAGAAGTACAT GG GAGGT GAT TAGCAAT 
TCT GAGGATTT T AAAAACAC CATACCCAT GGT GACACCACCTC CT C CACCT GT CTT CT CAT TG CTGAAGAT CAGT CAA 
AGAAT T GT GT G CTTAGTT CT TGATAAGT CT GGAAG C AT GGGGGGTAAGGAC CGCCT AAAT C GAAT GAATCAAG CAGCA 
■ AAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGGTGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTA 
f*% AAT AAGCTAAT CC AAATAAAAAGC AGTGAT GAAAGAAACACACT CATGGCAGGATTACC TACAT AT CCTCT GG GAGGA 

1*1 ■ ACTT CCAT CT G CT CT GGAAT T AAAT AT GCATTT CAGGT GATT GGAGAGCTACATT CCC AACTC GAT GGAT CC GAAGT A 

0 CT GCT GCT GACT GAT GGG GAGGATAACACT GCAAGT TCTT GT ATT GAT GAAGT GAAAC AAAGT GGGGC CATTGTT CAT 

ffl TTTATTGCTTTGGGAAGAGCTGCTGATGAAGCAGTAATAGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTATGTT 

TCAGATGAAGCTCAGAACAATGGCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAG 
m TCC CT T CAGCT CGAAAGTAAGGGATTAACACT GAATAGTAATGCCT GGAT GAACGACAC T GTCATAATT GATAGT AC A 

\l GTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
s ' j ATAAT GGAAAATTTCACAGT GGAT GCAACTT CCAAAAT GGC CTAT CT CAGTATT C CAG GAACT GCAAAGGT GG GCACT 

W T GGGCAT ACAAT CTT CAAGC CAAAGCGAACC CAGAAACAT TAACT ATTACAGTAACTT CT CGAGCAGCAAATT CT T CT 

GTGCCTCCAATCACAGTGAATGCTAAAATGAATAAGGACGTAAACAGTTTCCCCAGCCCAATGATTGTTTACGCAGAA 
AT T CTACAAGGATAT GT AC CT GTTCT TGGAG C CAAT GT GACT G CTTT CATT GAAT CAC AGAAT GGACATACAGAAGTT 
S TTGGAACTTTTGGATAATGGTGCAGGCGCTGATTCTTTCAAGAATGATGGAGTCTACTCCAGGTATTTTACAGCATAT 
f R H ACAGAAAATGG CAGATATAGCTTAAAAGTT C GGGCT CAT G GAG GAGCAAACACT G CCAGGCTAAAATTACGGC CT CCA 

?|! CTGAATAGAGCCGCGTACATACCAGGCTGGGTAGTGAACGGGGAAATTGAAGCAAACCCGCCAAGACCTGAAATTGAT 
U I GAGGATACTCAGACCACCTTGGAGGATTTCAGCCGAACAGCAT CCGGAGGTGCATTTGTGGTATCACAAGTCCCAAGC 

f 8 | CT T CC CT T GC CT GAC CAATACCCAC CAAGT CAAAT CACAGACCT T GAT GC C ACAGT T CAT GAG GATAAGATTATT CTT 

TI ACATGGACAGCAC CAGGAGATAATT TT GAT GTT GGAAAAGT T CAACGTTATAT CATAAGAATAAGT GCAAGTATT CT T 

Bl GATCTAAGAGACAGTTTTGATGATGCTCTTCAAGTAAATACTACTGATCTGTCACCAAAGGAGGCCAACTCCAAGGAA 

AGCTTT GCT^TTTAAACCAGAAAATAT CT CAGAAGAAAATGCAACCCACATATT TAT TGCCATT AAAAGTAT AGATAAA 
AGCAATTTGACATCAAAAGTATCCAACATTGCACAAGTAACTTTGTTTATCCCTCAAGCAAATCCTGATGACATTGAT 
1 II CC TACACCTACTC CT ACT CCTACTCCTACT C CT GAT AAAAGT CATAATTCT G GAGT TAATATT T CT ACGCT GGTATT G 

TCT GT GATTGGGT CT GTTGTAATTGT T AACTTTATTT TAAGTAC CAC CATT TGAA C CTT AAC GAAGAAAAAAAT C T T C 
AAGTAGAC CT AGAAGAGAGTT TTAAAAAACAAAACAAT GTAAGTAAAG GAT AT TT CT GAAT CTTAAAATT CAT CC CAT 
GT GTGAT CAT AAACT CATAAAAATAATT TTAAGAT GT CGGAAAAGGATACTTT GATTAAAT AAAAAC ACT CAT GGAT A 
T GT AAAAACT GTCAAGATTAAAATTTAATAGTT TCATT TATT T GTTATTT TATTT GTAAGAAATAGT GAT GAACAAAG 
ATC CTTT TTCATACT GATAC CT GGTT GTATATT ATTT GAT G CAACAGTTT T CT GAAAT GATATTT CAAAT T GCAT CAA 
GAAATTAAAAT CATCTAT CT GAGTAGT CAAAAT ACAAGT AAAGGAGAGCAAATAAACAACATTT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAZWWWIAAAAAAAAAAAAAAAAA 



5.5 s 
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FIGURE 7Q 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIYCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIWKLIQIKSSDERNTIMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLMSNAWMNDTVIIDSTVGKDTFFL 
!J I TWNSLPPS I SLWDPSGT IMENFTVDATSKMAYLS I PGTAKVGTWAYNLQAKANPETLT I TVTSR 

. AANSSVPPITVNAKMNKDVNSFPSPMIVYAEILQGYVPVLGANVTAFIESQNGHTEVLELLDNGA 
Is | GADS FKNDGVYS RYFTAYTENGRYSLKVRAHGGANTARLKLRP PLNRAAY I PGWWNGE IEANP P 

SI RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
y FDVGKVQRYIIRISASILDLRDSFDDALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAI 

3 it 

KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

O Signal peptide: 

Wjf amino acids 1-21 

* i* , 
•» *t 

^ » 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT CCTTAGGT GGAAACCCT GGGAGTAGAGTACT GACAGCAAAGACC GGGAAAGAC CATAC GTC CC C GGGCAGGG GTGA 
CAACAGGTGTCATCTTTTT GATCTCGTGTGTGGCTGCCTT C CTATTTCAAG GAAAGACGC CAAGGTAATTTTGACCCA 
GAGGAGCAATGATGTAGCCACCTCCTAACCTTCCCTTCTTGAACCCCCAGTTATGCCAGGATTTACTAGAGAGTGTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GT GCT CAGAAGT GACT TGCCT GAGGGT GGAC CAGAAGAAAGGAAAGGT CC C CT CTT GC T GT T GGCT GCACAT CAGGAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTaGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA 
AGTAGAGAAGCTGCT C TGT GT GGTGGTTAACT C CAAGAGGCAGAACT C GTT CTAGAAGGAAAT GGAT GCAAGCAGCT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
CG C AACTACGT GAGCAGCCT GAAGC GGCAGATCGCACAGCT CAAGGAG GAGCT GCAGGAGAGGAGT GAG CAGCTCAGG 
is AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 
lZ CT GGC CT T CCT GCACT C GCAG GTGGACAAGGCAGAGGT GAAT GCT GGC GT CAAGCT GGCCACAGAGT AT GCAGCAGTG 

U CCT TTC GATAGCTTTACT CTACAGAAGGTGTAC CAGCT GGAGACT GGC CTTAC CC G C C AC C C CGAG GAGAAGCCT GT G 

f! AGGAAG GACAAG C GGGAT GAGT T GGT GGAAGCCAT T GAAT CAGCCTT G GAGAC CCT GAACAAT CCT GCAGAGAACAGC 

21 CCCAAT CACCGTC CTT ACAC G GCCT CT GATT T CAT AGAAGG GATCT AC CGAACAGAAAGG GACAAAGGGACATTGTAT 

^ GAGCT CAC CT T CAAAGGGGACCACAAACAC GAATT CAAAC GGCTCAT CT TAT TTC GACCATT CAG C C CCAT C ATGAAA 

GT GAAAAATGAAAAGCT CAACAT GGCCAACAC GCT TAT CAAT GTTAT C GT GCCTCTAGCAAAAAG G GT GGACAAGTT C 
± C GG CAGT T CATGCAGAAT TT CAGGGAGAT GT GCAT T GAGCAGGAT GGGAGAGT CCAT CT CACT GTT GT TT ACTTT GG G 

_ — AAAGAAGAAATAAAT GAAGT CAAAGGAATACTT GAAAACACT T CCAAAGCTGC CAACT T CAGGAACT TTAC CTTCATC 

k| CAGCT GAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 

m TTTTT CT GTGAT GTGGACAT CTACTT CACAT CT GAATT CCT CAATACGTGTAG GCT GAATAC ACAG CCAG GGAAGAAG 

M 3 GTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCCTCCCTTGGAA 
~ CAG CAGCT GGT CATAAAGAAGGAAACT GGATT T TGGAGAGACT TT GGATT T GG GAT GACGT GTCAGT ATC GGT CAGAC 

fz\ T T CAT CAATATAGGT GG GTTT GATCT G GACAT CAAAGG CTGGG GCGGAGAGGATGT GCAC CT TTAT CGCAAGT AT CT C 

M CACAGCAACCT C ATAGTGGTAC GGACG CCT GT GCGAGGACT CTTC CAC CT CT G GCATGAGAAGCG CT GCAT GGACGAG 

||| CT GAC CC C CGAGCAGTACAAGAT GT GCATGCAGTC CAAGGC CATGAACGAGGC AT CCCAC GGCCAG CTGGGC AT GCT G 

,f?5 GT GTT CAGGCACGAGATAGAGG CTCAC C TT C GCAAACAGAAACAGAAGACAAGTAGCAAAAAAAC ATGAACT CCCAGA : 

M GAAGGATTGTGGGAGACACTTTTTCTTTCCTTTTGCAATTACTGAAAGTGGCTGCAACAGAGAAAAGACTTCCATAAA 
111 GGACGACAAAAGAAT T GGACT GATGGGT CAGAGATGAGAAAGC CT C CGATT T CTCTCTGTT GGGCT TT TTACAACAGA 

pa AAT CAAAATCT C C GC TTT GC CTG CAAAAGTAACCCAGT TGCAC CCT GT GAAGT GT CTGACAAAGGCAGAAT GCTT GT G 

M AGATT ATAAGC CT AATGGTGT GGAGGTTTT GAT GGT GT TTACAATACACT GAGAC CTGTT GTT TT GT GT GCT CATT GA 

if|| AATATTCATGATTTAAGAGCAGTTTTGTAAAAAATTCATTAGCATGAAAGGCAAGCATATTTCTCCTCATATGAATGA 

GCCTAT CAGCAGGGCTCTAGT TT CTAGGAAT G CTAAAATAT CAGAAG GCAGGAGAGGAGATAGGCT TATTAT GAT ACT 
AGT GAGTACATT AAGTAAAATAAAATGGACCAGAAAAGAAAAGAAAC CATAAATAT C GT GT CATATTT TC CC CAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTTTTCCCTTGT GAGTTATAGT CT GCTTATTTAATTACCACTTT GCAAGC C TTACAAGAGAG CACAAGTT GGCCTAC 
ATTTTTATATTTTTTAAGAAGATACTTTGAGATGCATTATGAGAACTTTCAGTTCAAAGCATCAAATTGATGCCATAT 
C CAAGGACAT GC CAAAT GCTGATTCTGTCAGGCACT GAATGT CAGGCATT GAGACATAGGGAAGGAAT GGT TT GTAC T 
AAT ACAGACGT ACAGATACTTT CTCT GAAGAGTAT TT T C GAAGAGGAGCAACT GAACACT GGAGGAAAAGAAAAT GAC 
ACT TT CTGCT TTACAGAAAAG GAAACT CAT T CAGACT GGT GATAT C GT GAT GTAC CTAAAAGT CAGAAACCACAT TTT 
CT C CT CAGAAGTAGGGACCGCT T T CTTACCT GT TTAAATAAAC CAAAGTATAC CGT GT GAAC CAAAC AAT CT CTT TT C 
AAAAC AGGGT GCT CCT CCTGGCTT CTGGCTTC CAT AAGAAGAAAT GGAGAAAAAT ATATATAT ATATATATATAT TGT 
GAAAGAT CAAT CCAT CT GCCAGAAT CTAGT GGGAT GGAAGTT TTT GCTACAT GTTAT C CAC CCCAGGC CAGGT GGAAG 
TAACT GAAT TATTTTTTAAATTAAGCAGTT C TACT CAAT CAC CAAGAT GCTT CTGAAAATT GCATT TTAT TAC CATTT 
CAAACTATTTTTTAAAAATAAATACAGTTAACATAGAGTGGTTTCTTCATTCATGTGAAAATTATTAGCCAGCACCAG 
ATGCATGAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACTCATTGTTTAAAAGCTTCAAGAAC 
ATTC^AGCTGTTGGTGTGTTAAAAAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTTAATTAAAACACAGG 
CCAT GAAT GGAAGGT GGTAT TGCACAG CTAATAAAATATGAT TTGT GGATAT GAA 
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FIGURE 72 

MMMVRRGLLAWI SRWVLLVLLCCAI SVLYMLACTPKGDEEQLALPRANS PTGKEGYQAVLQEWE 
EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENS PNHRPYT AS DFI EGI YRTERDKGTLYELT FKGDHKHE FKRL I LFRP FS P IMKVKNEKLNMAN 
TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANFRNF 
TFIQLNGE FSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 

jU KYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 

1*1 RKQKQKTS SKKT 

[,1 Important features: 

%| Signal peptide: 

|y . .. amino acids 1-27 

>-•; ? 

I a - . N-glycosylation sites. 

111 amino acids 315-319, 324-328 

:» ■ * ■ 

>I" EL * 

ftl N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Ami da ti on site. 

amino acids 377-381 
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FIGURE 73 



GAGACT GCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGAT T T GTCCTGGGGATCCA 
GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCT.GGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAAT GC CCAGTCGCCCATCGATAT TCAGACAGACAGTGT GACATT T GACCC T GAT T T GCC 
T GC T C T GCAGC CCCAC GGAT AT GAC C AGC C T GGCAC C G AGC C T T T GGAC C T GC AC AACAAT G GC C 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
T GAAGCCACAT T TGCAGAGCTCCACATTGTACAT TAT GAC TC TGAT TC C T AT GACAGC T TGAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATAT AGCT T AT GAACACAT TC TGAG TCAC TT GCAT GAAGT CAGGCAT AAAGATCAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
AT T T C AAT GGAACAGC TGGAAAAGC T T C AG GGGACAT T GT T C T C C ACAG AAGAGGAGC C C T C T AA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGTCTTCACCTCAGCACAAGCCACGACTGAGGCATAAATTCCTTCTCAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGT^AATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCC TTAGGATAAAGAGTTGCTGTTGAAGT TGTATAT TTT TGATCAATATATT TGGAAATTAAAG 
TTTCTGACTTT . 
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FIGURE 74. 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 75 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 
ATGGACTCCACCAGAGGGTCTT CAAGGAGT TAAAGT TACT TACACTGT GCAGTATTTCATCACAA 
ATT GGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCAT TT CT GTTGTCC TGACAGC TCC 
AGAGAAGT GGAAGAGAAATCCAGAAGACCT TCCTGT TTCCATGCAACAAATATAC TCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
N ; AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
% ;:f CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
*p TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 
I B S GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
%j ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 

AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGG T GAAACAT T TAGGGTAT GCT T CGCAT TT GATGGAAATT T T T TG TGAC TC TGAAG 
AAAACACG GAAGGT ACT T C TGT CAC CCAGCAAGAGT C CC T CAGCAGAACAAT ACCCCC GGATAAA 
ACAGTCATTGAATATGAATATGATGTCAGAACCACTGACATTTGTGCGGGGCCTGAAGAGCAGGA 
GCTCAGTT TGCAGGAGGAGGTGT CCACACAAGGAACATTATTGGAGT CGCAGGCAGCGT TGGCAG 



TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 



CAGGAGCACACAGAC T CGGAGGAGGGGCCGGAGGAAGAGCCAT CGAC GACCC T GGTC GACTGGGA 

III 

TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAG GCCACCAGGAGAAAATGAAAC CTATCTCATGCAAT T CAT GGAGGAAT GGGGGT TATA 
T G TGCAGATGGAAAACTGATGCCAACACT TCCTTTTGCCTTTTGTTT CCTG TGCAAACAAG T GAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAG TCAAT AAAT G T T T GCCAGAC T GGGT GCAGAAT T TAT TCAGGT GGGT G T 



APP ID=10063736 



Page 229 of 322 



FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRA.QPSEKQCARTLK 
DQS S E FKAKI I FW YVL P I S I TVFL FSVMG YS I YRY I HVGKEKHPANL I L I YGNE FDKR FFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGGTGAC 
ACAGGAGG TGAAGGACCACAACGCCACCAGCAT CC T GCAGCAGCTGCCGCTGC T CAGT GCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGAGTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
il% TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
I z I CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
%J CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
Mtf TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
" ' AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 

:~ 

p| TTCAGGCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
|f| ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATGGGCTGAAGTCAAGCATCGGGC 
tl T GAT CAAT GAAAAGGCTGCAGAT AAGCT GGGATCT ACCCAGAT CGT GAAGATCC TAACTCAGGAC 

ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
Jjg TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 

TTTACACCAAAGGTGACCAACTTATACTCAACTTGAATAACATCAGCTCTGATCGGATCCAGCTG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 

GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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MAGPWTFTLLCGLLAA.TLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGI PVLGS LVWTVLKHI I WLKVI TANI LQLQVKPSANDQELLVKI PLDMVAGFNTPLVKT I VE 
FHMT TE AQAT I RMDT SAS GPTRLVLS DCATSHGS LR I QLL YKLS FLVNALAKQVMNLLVPSLPNL 
VKNQLCPVIEAS FNGMYADLLQLVKVPI S LS I DRLE FDLLYPAIKGDT I QLYLGAKLLDS QGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDWKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLILNLNNI SSDRIQLMNSGIGWFQPDVLKNI I TEI IHS ILLPNQNGKLRSGVPVSLVKALG 
FE AAE S S L T KDAL VL T PAS L WKP S S P VS Q 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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GAGAGAAGTCAGCC TGGCAGAGAGAC TCTGAAAT GAGGGAT TAGAGGT GT T CAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
C CAAGGGC CT C TGGAT GGAATGT GCCACACACAGCACAGGCAT CAC CCAGT GT GACAT C TATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGT CT GT GAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
G AC TAAT T T G T GC AT GAAC T GAAAT AAAAC CAT C C T AC G G TAT C C AGG G AACAGAAAGC AGGAT G 
CAGGATGGGAGGACAGGAAGGCAGCC TGGGACAT TTAAAAAAATA 
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FIGURE 80 

MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YS TLLGLPAD I QAAQAMMVTS SAI S S LAC 1 1 S WGMRCT VFCQESRAKDRVAVAGGVFFI LGGL 
LG FI PVAWNLHG I LRD FYS PLVPDSMKFE I GEAL YLG 1 1 S S L FS L I AG IILCFSCSS QRNRSNY Y 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 
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IS 2 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
■ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CO CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
Jf{ CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 

ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GT GAAGTACCT GGC CCAAGGCCACACAGC CAGAATC T TCCACT T GAC TCAGATCAAGAAAG TCAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGTC TT CC TAGCCCAGACCCT GGAGC T GAAGGT CAC GGCCAGT C CAGACAAAGT GACC AAG 
ACATAACAAAGACC TAACAGTTGCAGATATGAGCTGTATAATT GT TGT TATTATATATTAATAAA 
TAAGAAGT T GCAT TACCCT CAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLS S LLGLALLWFPLDSHARARPDMFCL FHGKRYS PGE S WHP YLE PQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
I S S TRCPKAPGRVL VHT S VS PS PDNLRR FALEHEAS DLVE I YLWKLVKDEE TEAQRGE VPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 83 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CC AGTT CT CT AG CGTGGT CC ACCT CT AC AGGGACGG GAAGGACC AGCC AT TT AT GC AG AT GCCACAGT AT C 
AAGGCAGG AC AAAACT GGT GAAGG AT T CTATT GCGGAGGGGCGC ATCT CT CTGAGG CTGG AAAACAT T AC T 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 

is : 

I* AC AGGT GT C AGC ACTGGGCT CAGTTC C T CT CAT T T C CATC ACGGGATATGTTGATAGAGAC AT CC AGC T AC 

J! TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 

&l AC AG AC TCCAGGACAAAC AG AGAC AT GC AT GGCCT GTTTGAT GT GG AG AT CT CT CT GACC GT CC AAGAGAA 

i: 5 a . 

CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 
\'i GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 



*** fv"- 



TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
GAGAAG AAAGCACGGAC AGGC AGAAT T GAG AGAC GC CC GGAAAC ACGC AGTGGAGGT GAC TC T GGATC C AG 
* AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
W_ CCTCACTCTGAGAAGAGATTTACAAGGAAGAGTGTGGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTA 
*M CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 

AGGAGT ACGTGAC T T T GT CT CCCGAT CAT G GGT ACT GG GTCC TC AG AC TGAATGGAG AAC AT T T GT AT T T C 
IZ AC AT TAAAT CCCCGTTTTAT C AGCGT CTT C CCCAGG AC CCCACC T ACAAAAAT AGGGGT CTT CCTGGAC T A 



8"W 



TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGG CT TAT T GAGGCC CT AC AT TGAGT AT CCGTCCT AT AATGAGCAAAAT GGAACT CCCATAGT CATC TGC 
CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 
T GAGTC CTCC T C ACAGGC AACCACGC C CTT CCT CCC CAGGGGTGAAAT G TAGG ATGAATC AC AT CCC ACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CTGACATTACATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAG 
AACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACAAGGAATGTGAATAATGCTTAG 
ATCTT ATT GAT GACAGAGT GTATCCT AAT GGT TTGT TCATT AT AT T AC AC TTT C AGT AAAAAAA 



APPJD-l 0063736 Page 237 of 322 
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MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSC FLS PKTNAEAMEVRFFRGQFS S WH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQS FQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLS PDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 8s 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCT CTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAAC TGCTGACGAT GCAGAGT TCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 
TACTTCTTTCGTATGGAGAAAGGAAGTATAAAATGGAATTATAAACATCACCGGCTCTCTGTGAA 
N ; TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
W AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
g-j GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 

y & 

\ . s. GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 

\| ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 

W GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 

w ' GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCGTGCCAGGCTGAGCCTGAGCTGGA 

J B6 GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 

iff CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 

O GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 

HI CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 

!:! TCGGCAAGGCCAGGAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 

i! K '* 

AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAACT GCAGAGAC TCACCCT GAT T GAGGGATCACAGCCCCTC CAGGC AAGGGAGAAGTCA 
GAGGC TGATTC T TGTAGAAT TAACAGCC CT CAACGT GAT GAGC TATGAT AACAC TATGAAT TAT G 

TGCAGAGTGAAAAGCACACAGGCTTTAGAGTCAAAGTATCTCAAACCTGAATCCACACTGTGCCC 
TCCCTTTTATTTTTTTAACTAAAAGACAGACAAATTCCTA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANT DQDAP VATNNPARAVWEE TRDRFHLLGDPHTKNCTLS I RDARRS DAGR Y FFRMEKGS I KWNY 
KHHRLSVNVTALTHRPNIL I PGTLESGCPQNLTC SVPWACEQGTPPMI SWI GTS VS PLDPS TTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE. 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGT GCAGC TGAGAC TCAGACAAGAT TACAATGAAC CAAC T CAGCT TCCTGCTGTTTC 
T CATAGCGACCACCAGAGGATGGAGT ACAGAT GAGGC TAATACT TAC T T CAAGGAATGGACC TGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAAC TACAACACC T T TGGATC T GCAGAGGCGGC CACGAGCGAT GACT ACAAGAACCC TGGC TACT 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
GT T T GGCAT CTACCAGAAATATCCAGTGAAATATGGAGAAGGAAAGT GT T GGAC TGACAACGGCC 
CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
GGCCAGCGGGAAT TCACTGCGGGAT TTGT TCAGTTCAGGGTAT TTAATAACGAGAGAGCAGCCAA 
CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 
GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
TTGAGAGTTTTGTGGGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 
CAAC T TACCCAGT AGC TAGAAT GT T AAT GG CAGAAGAGAAAACAATAAAT CATAT TGAC TCAAGA 
AAAAAA 
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MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
S DDYKNPGY YD IQAKDLGI WHVPNKS PMQHWRNS S LLRYRTDT G FLQTLGHNL FG I YQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCC ATG GGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTC TGA TTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATT^AAATATGTTTTGTAGTAACA 
T TAAGACT TATATACAGT T T TAGGGGACAATTAAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNS TQSL I S I GAHCS AS VALS FFI FERWECT TYWYI FVFCSALPAVTEMALFV 

TVFGLKKKPF 

i 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE qi 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAG ATG CAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACC CTCAGCCAAGG TGAAGAAAAT C T T T GGC T GGGGCGAC TT C TACT CCAACATCAAGAC GGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAA7\ATCTTCAACTGCCGGATG 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TC TACATC GCCT TC TACAGCACGGAC TATCGGC TGGT CCAGAAG GT GT GCCCAGAT TACAAC TAC 
CAT AG T GAT AC C CC C T AC T AC C CAT C T GG G TGAC CC GGG GC AG GC C AC AGAG GC C AGGC C AGG G C 
T G G AAG G ACAGG C C T GCC CAT G CAG GAG AC CAT C T GGAC AC C GGG C AG G GAAG GGG T TGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
AGAAAGGGTCCCAAGTGC TGGTCCCAACCTGAAGCTGTGGAGTGACTAGAT CACAGGAGCACTGG 
AGGAGGAGTGGG.CTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCT T GGT T C TT GCCATCC T GAG GAAAGAT AG CAAC AGG GAG GGG GAG AT TT CAT CAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCAT CC T TAAGC TAAGACAGGACGAT T GTGGTCC TCCCACAC TAAGGC C ACAGCCCAT C 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMZ\NSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNI S I SLVPPSKAVE FHQEQQI FIEAKASKI FNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 



CGGTGGCC ATGA CTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
AT GT T C CGAT T T GC AT AT TATAAACT CT T AAAAAAAGCCAG T G AAGGT T T GAAGAGTATAAACC C 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGGCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACC T CAGGGAACCAGCACT TCCCAAACCGCAGACT ACAT C TT TAGAGGAAGCACAAC T GT GCCT 
TTTTCTGAAAATCCCTTTTTCTGGTGGAATTGAGAAAGAAATAAAACTATGCAGATA 
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FIGURE Q4 

MTAAVFFGCAFIAFGPALALWFTIAIEPLRIIFLIAGAFFWLVSLLISSLWFMARVIIDNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium neurotransmitter symporter family protein 

amino acids 31-65 
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Page 



FIGURE 

AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTCGTG 
GACCCAAAGGT AGC AAT C TGAAACATGAGGAGT ACGAT T C T AC TGT T TTGTCTTC TAGGAT CAAC 

TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 

GAACACTACCAAACCAACAGCAG TCAAAT CAGGTC T TTCCTTCTT TAAGT CTGATACCATTAACA 

CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 

CCAGACCGACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAC 

CAAT T T T T GTCACACAAC T TGGAGCCCAGGGCAC TATCC TAAGC TCAGAGGAAT TGCCACAAAT C 

TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 

T AAT CCAGAT GT CCAGGATGGAAGCCT TCCAGCAGGAGGAGCAGGT GTAAATCC T GCCACCCAGG 

GAACCCCAGCAGGCCGCCTCCCAACTGCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 

GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 

G TAA GCTGTTTCAAATTTTTTC7^ACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 

AT T GAT TATATTATGGAATAGAT TGAGACACAT T GGATAGTCT TAGAAGAAAT TAAT T C T TAAT T 

T AC C T G AAAAT AT T C T T G AAAT T T C AG AAAAT AT G T T C TAT G TAG AG AAT C C CAAC T T T T AAAAA 

CAATAATTCAATGGATAAATCTGTCTTTGAAATATAACATTATGCTGCCTGGATGATATGCATAT 

T AAAACAT AT T T GGAAAAC T G G AAAAAAAAAAAAAAAZ\AAAAAAAAAAAAAAA?Uy\AAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLP I FVTQLGAQGT ILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
T DDD FAVT T PAG I QRS THAI EE AT TE SANG I Q 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
ACCAGCC T TGACCACGCC CTGGAGAGGACC TAC CAGC TGT TGGTACAGGT CAAGGACAT GGGTGA 
CCAGGCC T CAGGCCACCAGGCCAC T GCCAC CGT GGAAGTC T CCATCAT AGAGAGCACC T GGGTG T 
CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 

CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
TGCAGAGGGAAACCTC TAC GT GACCAGAGAGC T GGACAGAGAAGCCCAGGCT GAGTACC TGC TCC 

AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
CAGT CCACCAGGTAC TGAAG TGACTAGACT GT CAGCAGAGGATGCAGATGC C CCCGGC T CCCCCA 
ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
ATTTTCACCCACTGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
T CCCC TGGGAGAGAGCCCAGCACCCAAGAT CCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAAT.AATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVXWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVS I IE S TWVS LE P IHLAENLKVL YPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEWVWQS VAKLVGPGPGPGATATVTVLVERVMP PPKLDQES YEASVP I SAPAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWEP 
REHIIPVWSHNAQMWQLLVRVIVCRCNVEGQCMRKVGRMKGMPTKLSAVGILVGTLVAIGIFLI 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE QQ 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTCCAAGATTCTTCCCAGGAACACAAAGGTAGGAGACCCACGCTCCTGGAAGCACCAGCCTTTA 
TCTCTTCACCTTCAAGTCCCGTTTCTCAAGAATCCTCTGTTCTTTGCCGTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATCT TGCT TT CCAGCCACAAAGAGACAGATGAAGATGCAGAAAGGAAATG 
TTCTCCT TATGTTTGGTCTACTATT GCATT TAGAAGCTGCAACAAAT TCCAATGAGAC TAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGAGCTCCAATGGGGTCA 
GCATAGTCACCAACTCTGAGTTCCATACAACCTCCAGTGGGATCAGCACAGCCACCAACTCTGAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCACCAAC TCT GAGT CCAGCACACC C TCCAGT GGGGCCAGCACAGT CACCAAC T 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AGT AGGGCCAGCAC T GCCACCAACT C TGAGTC TAGCACAC TCT C CAGT GGGGCCAGCACAGCCAC 
CAACTCTGACTCCAGCACAACCTCCAGTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
GCCACCAACTCTGAGTCCAGCAC7^ACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCACAGCCACCAACTCTGACTCCAGCACAGTGTGCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCACCAACTC T GACTCCAGCAGAACCT C CAGT GAGGCCAGCACAGCCACCAACT CT GAGT C TAG 
CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
ACACAGCCACCAACT C TGGGTCCAGT GT GACCT CTGCAGGC TC T GGAACAGCAGC TC TGAC T GGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GTGGAGTCCTAACTGGTTCTGGAGGAGACCAGTATCATCGATAGCCATGGAGATGAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
T^ATCTTGAAGAAGGTATTCCTCACCTTTCTTGCCTTTACCAGACACTGGAAAGAGAATACTATAT 
T GCT CAT T TAGC TAAGAAATAAAT ACATCT CAT CTAACACACACGACAAAGAGAAGC TGTGC T T G 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CAT C T GG CAT T C AAAAT C T C C AC AG T AAAAT C C AAAG AC C T C AAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 

MKMQKGWLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTS SEAS TATNSESSTVS SGI STVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 

AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPI^WFWRRPVSSI 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGATGTGATATATCCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGC AG T T T C T AGC TACT GCAC GT AC T T CATAAAGC AGGAC T C TAAAAGC T T T GGAATCAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
W GAT TATAACT GGGGATGCAT TTGTTCCT GGAGAAAGAAGT GTCAT TAT CAT GAACCAT CGGACAA 

HI GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
Ul TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 



s i; " 



s 



C TT CAT T C AT AGGAAATGGAAGGAT GACAAGAGCCAT TT CGAAGACATGAT T GAT TACT T T T GT G 
ATAT T CACGAACCACTTCAACTCCTCATAT TCCCAGAAGGGACTGATCTCACAGAAAACAGCAAG 
T CT CGAAGTAATGCAT TTGCTGAAAAAAAT GGACTTCAGAAATATGAATAT GTT T TACATCCAAG 
AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
ATATCACTGTGGCGTATCCTCACAACATTCCTC71ATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
CCCAGGGA7\ATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
G GGAGAAGAAT T TT TAT T T TACCGGACAGAGTG TCAT TCCACC T T GCAAGT C T GAAC T CAGGGT C 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAAT AT T T GG T GGAC TGGAGAT CATAGAAC T T GCAT GT TACCGAC T T T T ACAC AAAC AGCCACAT 
T TAAATTCAAAGAAAAATGAGTAAGATTATAAGGTT TGCCATGT GAAAACC TAGAGCATATT TTG 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCC AT CAT TAT T TGT TAAAGATAT T T TGCACT TAAT T T TGTGGGAAAAATAT T GC TACAAT T TT T 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
G GGCC AGAATAT TAT TAAACAAT CAT CAGGCT T T TAAA 
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FIGURE 102 



MHSRGRE I WLLNPWS INEAVSS YCTYFIKQDSKS FGIMVSWKGI Y FI LTLFWGS FFGS I FMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLWKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LEI IELACYRLLHKQPHLNSKKNE 

y Important features of the protein: 

SJ Signal peptide: 

amino acids 1-22 

\l 

Ul Transmembrane domains : 

amino acids 44-63, 90-108, 354-377 

K 

*f* "» W - 
if *r 

:: 5 

■|asr| 

fin 

6 
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FIGURE 1Q3 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
T CCAAAT CATCCATC CACCCCT GCT GT CAT CT GT TT TCATAGT G T GAGAT CAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCTCTACAGAGATGGGGAAGACTGGGAATCTAAGCAGATGCCACAGTATCGAGGGAGAACTGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
y ? AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
lZ ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
y GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
13 TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
fp AAAT C CAAAGGGAAAATC CAGGCGGAACT GGAC T GGAGAAGAAAGCACGGACAGGCAGAAT T GAG 

| g | AGACGCCCGGAAACACGCAGTGGAGGT GACTCTGGAT CCAGAGACGGCT CACCCGAAGC TC T GCG 

Tl T TTC T GAT CT GAAAAC TGTAACCCAT AGAAAAGCTCCCCAGGAGGTGCCTCACT C TGAGAAGAGA 

?! T TTACAAGGAAGAGTGTGGTGGC T TC TCAGGGT T TC CAAGCAGGGAGACAT TACT GGGAGG T GGA 

W CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
f|l ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 

ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
pi GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
;^ TGACATGTCAGTTTGAAGGCTTGTTGAGACCCTATATCCAGCATGCGATGTATGACGAGGAAAAG 
W| G GGAC T C C CATAT T CATAT G TC C AG T G T C C T GGGGATGAGACAGAGAAG AC CC T GC T TAAAGGGC 

Q C CCACACCACAGAC CCAGACACAGCCAAGGGAGAGT GC T CC CGACAGG TGGCCCCAGC TTCC TC T 

g| CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
JS-I T GAG C C C T G C AG CAGC GGC AG T C AC AG C T T CC AGAT GAGG G GG GAT T G GC C T GAC C C T G T GGGAG 

y TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGTTTGTGAAAA 

hi CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 

ACAGT GAT TCCTGCC T CACAGGTGAAGAT TAAAGAGACAACGAAT GT GAAT CAT GCT T GCAGGT T 
TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 
T GC T TAT T CC AC AT TAAT T T AC T T T T C T C T AT ACCAAATCACCCAT GGAAT AG T TAT T GAACAC C 
T GCTT TGTGAGGGTCAAAGAATAAAGAGGAGGTAGGATT TT TCACTGAT TCTATAAGCCCAGCAT 
T ACC T GAT ACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAAC TACAGG T CCATATCC 
C T CAT T AAC AC AGAC AC AAAAAT T C T AAAT AAAAT T T T AAC AAAT T AAAC T AAAC AAT AT AT T T A 
AAGAT GATATATAACTAC TCAGTGTGGTTT GTCCCACAAATGCAGAGTTGGTTTAATAT TTAAAT 
ATCAACCAGT GTAAT T CAGCACAT TAAT AAAG TAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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FIGURE 104 

MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
I VQENAGS I LCS IHLAEQSHEVESKVLIGE T FFQPS PWRLAS ILLGLLCGALCGWMGMI IVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WAS QGFQAGRHYWE VDVGQNVGWYVGVCRDDVDRGKNNVTLS PNNGYWVLRLT TEHLYFT 

FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide : 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150,, 235-259 
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FIGURE 105 

CC TTCAC AGG AC T C T T CATT GCTGGT TGGC A ATGA T GT ATCGGC CAGATGTGGT GAGGGCT AGG AAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACT GT TC AT T AT GTGAG AT AT AATC AAAAGAAG ACCT AC AATT ACT ATAGC AC AT T GTC AT TT AC AAC 
T G AC AAACT AT ATGCT GAGT TTGGCAGAG AGGCTT C TAAC AATT TT AC AG AAAT GAGCCAGAGACT TGAAT 
C AAT GGTGAAAAAT GC AT TT T AT AAATCT C CAT T AAGGGAAG AAT T TGTC AAGT CT C AGGTT AT CAAGTTC 
AGTCAACAGAAGCATGGAGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGA 
AACTGTAGATAAAATTGTTC7VACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
AT CCT C AC T C AGTT AAAAT T AAAAAAAT CAAC AAGACAGAAACAGAC AGC T ATCT AAAC CAT TGCT GCGGA 
AC ACGAAGAAGT AAAACT C T AGGT CAGAGT CT C AGGAT CGTT GGTGGG AC AG AAGT AG AAGAGGGT GAAT G 
GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
ATAAAACCT TCGAAAATG AAACGGGGT C T C CGGAGAAT AATT GT CC AT GAAAAATACAAAC ACC CATC ACA 
TGACT ATGAT AT TTCT CT T GCAGAGC T TT C TAGC CC TGT TCCCT AC AC AAAT GC AGT AC ATAGAGT T T GT C 
TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 
GATGGT T AC AGT C AAAAT C ATCTT CG AC AAGC AC AGGT GAC T CT C AT AGACGCT AC AACT TG CAAT GAACC 
TCAAGCTTACAATGACGCCATAACTCCTAGAATGTTATGTGCTGGCTCCTTAGAAGGAAAAACAGATGCAT 
GCCAGGGTGACTCTGGAGGACCACTGGTTAGTTCAGATGCTAGAGATATCTGGTACCTTGCTGGAATAGTG 

AGCT GGGGAGAT GAAT GT GCGAAACC CAAC AAGC CT GGTGT T TATACT AG AGTT AC GGCC TT GCGGGACTG 
GATTACTTCAAAAACTGGTATCTAAGAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTG 
GGTGTGGAGGCCATTTTTAGAGATACAGAATTGGAGAAGACTTGCAAAACAGCTAGATTTGACTGATCTCA 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GAT C AACT CT GT CAT C TGTG AGCAAT AGTT GAAACT TT AT GT AC AT AG AG AAAT AG AT AAT AC AAT AT TAC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TC AAGGTGCAGAAC AAGGAGTG AAAGAAAAT AT AAG AAGAAAAAAATCCC CT AC AT T TT ATT GGCACAGAA 
AAGT AT T AGGTGTT T T TCTT AGTGGAAT AT T AGAAATGATC AT ATTC ATT AT GAAAGGTC AAGC AAAGACA 
GC AGAAT ACC AATC AC TT CATC AT T T AGGAAGT ATG GGAACT AAGTT AAGGAAGTCC AGAAAGAAGCC AAG 
AT AT AT CCT TAT TT T CAT TT CC AAAC AACT ACT ATGAT AAAT GT GAAG AAGATT CT GTT T TT T TGT GACCT 
AT AAT AAT TAT AC AAACT T CAT GCAATGT ACT T GTT CT AAGC AAATTAAAGC AAAT ATTT AT T T AACATT G 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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FIGURE 106 



.V, f. 



MMYRPDWRARKRVCWE PWVIGLVI FISL IVLAVC I GLT VHYVRYNQKKT YNYYS TLS FT TDKLY 
AE FGRE ASNNFTEMSQRLE SMVKNAFYKSPLREE FVKS QVIKFS QQKHGVLAHMLL I CRFHS TE D 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL I DAT TCNE PQAYNDAI T PRMLCAGS LEGKTDACQGDSGGPLVS SDARDI WYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 



lit 



' 1 
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FIGURE 1Q7 



AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGTTCTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGTCATTTTGACTTACTTTCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTAGTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GATATGGTCACTTTGAAAGCCAGGATGACAAATTATGGATTACCTAGATATCGGTGGCTTACTCA 
T GCT TGGAAT T TT TT TCAGAGAGAGT T TAAGTGC T GT GGAGTAGT AT ATT T CAC T GAC TGGT TGG 
AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 
CAGGCCCACCAGGAAGATC TCAGTGACCT T TAT CAAGAGGGT T G T GGGAAGAAAATGTATT CC TT 
T T T GAGAGGAACCAAACAAC T GCAGGT GCT GAGGTT T C T GGGAAT C T CCAT T GGGGT GACACAAA 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAAT GATGTCC TTGAAGAATGACAACTCTCAGCACCTGTCATGTCCCTCAGTAGAACT 
GTTGAAACCAAGCCTGTCAAGAATCTTTGAACACACATCCATGGCAAACAGCTTTAATACACACT 
T T GAGAT GGAGGAGT T ATAAA AAGAAAT GT CACAGAAGAAAACCACAAACT T GT T TTAT TGGACT 
TGT GAAT T T TT GAGTACATACTATGT GT T T CAGAAATATG TAGAAATAAAAATGT TGC CAT AAAA 
TAACACCTAAGCATATAC TATT CTAT GCTT TAAAAT GAGGATGGAAAAGT T T CATGTCATAAGTC 
ACCACCTGGACAATAATTGATGCCCTTAAAATGCTGAAGACAGATGTCATACCCACTGTGTAGCC 
TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
TCCAT GCAAACGAG TCACATAT GGT GGGAC TGGAGCCATAGTAAAGGT TGAT T TAC T T CTAC CAA 
C T AG T AT AT AAAGT AC TAAT TAAAT GCT AACAT AGGAAGT TAG AAAAT ACT AAT AAC T T T TAT T A 
CTCAGCGATCTATTCTTCTGATGCTAAATAAATTATATATCAGAAAACTTTCAATATTGGTGACT 
ACCT AAAT GT GAT T T T TGCTGG TTAC TAAAAT AT TC T TAC CAC T TAAAAGAGCAAGC TAACACAT 
TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
T T C AG T T C T GAT AAT G TT AAGAAT AACC AT TAT GAAAAGGAAAAT T T G T C C T GT AT AGCAT CAT T 
AT T T T TAGC C TTTCCTGT TAAT AAAGC T T TAC T AT TC T GT CC T GGGC T TAT AT T ACAC ATAT AAC 
TGTTATTTAAATACTTAACCACTAATTTTGAAAATTACCAGTGTGATACATAGGAATCATTATTC 
AGAAT GTAGT C T GGT C TT TAGGAAGT AT TAATAAGAAAAT T TGCACATAAC T TAG TTGAT T CAGA 
AAGGACTTGTATGCTGTTTTTCTCCCAAATGAAGACTCTTTTTGACACTAAACACTTTTTAAAAA 
GCTTATCTTTGCCTTCTCCAAACAAGAAGCAATAGTCTCCAAGTCAATATAAATTCTACAGAAAA 
TAG TGTTCTTTT TCTCCAGAAAAAT GC T TG TGAGAATCAT TAAAACAT GT GACAATT TAGAGAT T 
CTTTGTTT TAT T T CAC T GAT TAAT AT AC TGT GG C AAAT TACAC AGAT TAT TAAAT T T T T T T ACAA 
GAGTATAGTATATTTATTTGA7VATGGGAAAAGTGCATTTTACTGTATTTTGTGTATTTTGTTTAT 
T TC T CAGAAT ATGGAAAGAAAAT TAAAAT G T GT CAATAAATAT T T T C T AGAGAGTAA 
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FIGURE 108 



MAREDSVKCLRCLLYALNLLFWLMS I SVLAVSAWMRDYLNNVLTLTAETRVEEAVILTYFPWHP 
VMIAVCCFLIIVGMLGYCGTVKRNLLLLAWYFGSLLVIFCVELACGWTYEQELMVPVQWSDMVT 
LKAEIMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 

EDLS DL YQEGCGKKMYS FLRGTKQLQVLRFLGI S I GVT Q I LAMI LT I T LLWAL YYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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FIGURE log 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGG ATGC CTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCTGGAGGAACACTGACCACCAGTTGGATGAGTCTCAAGGTCCTCCTCTATGTGACAACCATGTGAATGGGGAG 
T GGT ACCACTTCAC GGGCAT GGCGGGAGATGCCAT GCCT ACCTTCTGC AT ACC AGAAAAC GACT GTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
CAGCT GCTC AGAT ACCAGC GAGTGCAC ATGC GCTCCAGGAACT GT GCTAGGC CCT G AC AG GC AG AC AT GCTTTG 
AT G AAAAT GAAT GT G AGC AAAAC AAC G GT GG C T GC AGT GAGAT C T GT GT GAACC T C AAAAACT C C T AC C GCT GT 
GAGTGTGGGGTTGGCCGTGTGCTAAGAAGTGATGGCAAGACTTGTGAAGACGTTGAAGGATGCCACAATAACAA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CTGAGGATAACCACACTTGCCAAGTCCCTGT GTTGTGCAAATCAAAT GCCAT TGAAGT GAACATCCCCAGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCTCTTCTCTCT C AAGAC AT GT GGT AC AGT G GT C GAT GT GGT GAAT G ACAAGAT T GT GGC C AGC AACC T C GT GA 



* ii 

T* " 1> 



•i E :; 
■a? - 



CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACC T GC G AGT T T CCAC GCC T GT AC ACC ATT T C T GAAGGAT AC GT TC C C AACC T T C G AAAC T CCC C ACT GGAAAT 
![! CATGAGCCGAAATCATGGGATCTTCCCATTCACTCTGGAGATCTTCAAGGACAATGAGTTTGAAGAGCCTTACC 
t::f GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 

,f »< 

M !\ 

III. TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
Ifs CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 

.<; s s 

TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGAC TAG TTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACT GT GAGT T C AGAC T CCC AGC ACC AAC TC ACT C T GATTCT GGT CCATTCAGT GGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
C AC AAAAT C AG AAGCT GGGT AT AAT AT T TC AAGT T AC AAAC CC T AGAAAAAT T AAAC AGT TAC T GAAAT T AT GA 

CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TTATAGGGAAT TT GGAAGT GT AT C AAT AAAAC AGT AT AT AAT T T T 
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MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDVWDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLBQTYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 



Important features of the protein: 
Signal peptide: 

amino acids 1-16 



N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 : 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE 111 

GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CC TGCT GCCTTGGGGTGACAAT CT C AGCT C CAGGCT AC AGGGAG ACCGGG AGGAT C AC AGAGCC AGC ATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 

AT GG AGAC CTTC AGAAAGGT GGGG AT C CC C AT C AT C AT AGCACT ACTG AGCC T G GCGAGT ATC ATC AT TGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GG AAGC AGCT GT GT GACGGAGAGCTGGACT GT CCCT TG GGGG AGGACGAGGAGC AC TGT GT C AAGAGC TT C 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
GC T ACAGCAGAGCT GT GG AG AT TGGC CC AG ACCAGGAT CTGG AT GT T GTT GAAATC AC AGAAAACAGCCAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
TC AGC AT C CAGT ACGAC AAACAGC AC GTCT GT GGAG GGAGCATC CTGG AC CCCC ACTGGGTCCT C ACGGC A 
GCCC ACT GCT TC AGGAAAC AT ACCGAT GT G TT C AAC TGGAAGGT GCGGGC AGGC TC AGAC AAACT G GGCAG 
CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAAAGACAATGACA 
TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATGAGGAGC TC ACTC CAGC C ACCCC ACT C TGGAT C AT T GGATGGGGCTT T ACG AAGCAG AATGGAGGGAA 
GATGTCTGACATACTGCTGCAGGCGTCAGTCCAGGTCATTGACAGCACACGGTGCAATGCAGACGATGCGT 
ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 
GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
CTGCGGGGGCCC GAGC AC C CCAGG AGTAT ACACC AAGGTC T C AGCCT ATCT CAACT GG AT CT AC AATGT CT 
GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGG AT C CCC C AAAGT C AG ACAC AGAGC AAGAGT C C CCT T GGGT ACACCC CTCT GCCC AC AGCCT C AG CAT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GCAGCC CT AGCT CGGC CAC AC T TGGT GC TC CC AGC ATCCC AGGGAG AGAC AC AGCC C ACT GAACAAGG TCT 
C AGGGGT AT T GC TAAGCC AAGAAGGAACTT T C CC AC ACT ACT GAAT GGAAGC AGGC TGT C TT GTAAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCT AC T AGAGC AAGAAACCAGTT GT AAT ATAAAAT GC AC TGCC CT ACTGTTGGT AT GACT ACCGTT ACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCA^AAAAAA 
AAAA 
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FIGURE 112 

MLQDPDSDQPLNSLDWPLRKPRIPMETFRKVGIPIIIALLSLASIIIVWLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKS FPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KT PRWGGEEAS VDS WPWQVS I QYDKQHVCGGS I LDPHWVLTAAHC FRKHT DVFNWKVRAGS DKL 
GSFPSIAVAKI II IEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE 112 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTTyVGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
I*' CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
Izl AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
jfj GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACGTTTTCATGTCCTGCACATCACCTG 
[ g I AT C CAT GGGCTAATC TGAAC T C TGT C CCAAGGAACCCAGAGC T T GAG T GAGCTGTGGC TCAGACC 

SJ CAGAAGGGGTC TGCT TAGACCACCTGGT TTATGTGACAGGACT TGCAT TCTCCTGGAACATGAGG 

IU. GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 
~ M GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
l* % CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
m CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
O TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
Kl CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 

GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
CAAATGATCTCCAAGGGCCCTTATACCCCAGGAGACTTTGATTTGAATTTGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTACTCGGGAGGC TGAGACAGGAGAAT TACT TGAACC TGGGAGGT GAAGGAGGCTGAGACA 
GGAGAATCACTTCAGCCTGAGCAACACAGCGAGACTCTGTCTCAGA7VAAAATAAAAAAAGAATTA 
TGGTTATTTGTAA 
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FIGURE 114 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE us 



CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACCATGGCAA 
AGAAT CC T CCAGAGAAT TG TG AAGAC T GT C AC AT T C T AAATGCAGAAGC T T T TAAAT CC AAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GT T T TGG GGGAGCAAGCAC TT C TGGCCGGAGGTACCCAAAAAAGCCT AT GACATGGAGC ACAC TT 
T C T AC AG C AAT GGAG AGAAGAAG AAG AT T T AC AT GGAAAT T GAT C C T G T G AC C AG AAC T GAAAT A 
T T CAGAAGCGGAAAT GGCAC T GAT GAAACAT TGGAAG TGCACGAC T T T AAAAAC GGATACAC TGG 
CATCTACTTCGTGGGTCTTCAAA7VATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
C T GAACC AGAAGAGGAAAT AG AT GAGAAT GAAGAAAT TACC AC AAC T T T C T T T GAAC AGTCAGT G 
ATT T GGGT CCCAGCAGAAAAGCC TAT TGAAAACCGAGAT T T TC T TAAAAAT TCCAAAAT TC TGGA 
GAT T T GT GATAACGT GACCAT GT AT T GGAT CAATCCCAC TC TAAT AT CAGT T TCTGAGT TACAAG 
ACT T T GAGGAGGAGGGAGAAGAT CT T CACT TTCC T GCCAACGAAAAAAAAGG GAT TGAACAAAAT 
GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 
AGAAC TTCCAATAAATGACTATACTGAAAATGGAATAGAAT TT GATCCCATGCTGGATGAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
T TATC ACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV. 

Important features of* the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

GlycosaminoglYcan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATC ATG TCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGT TCC AGT ACG AAGG G CT CT GG AG G AGCTGCGTGAGGC AGAGTT CAGGCT TCACCG AAT GC AGG CCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CC AAAGCCAACATGAC ACTGACCT CCGGGATCAT GTTCAT TGTCTC AGGT CTTTGTGCAATTGCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
l H GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
ill TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
ffi GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
f =i j TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 
SI CTT CCAAGC ACGACT ATGT GTAATGC TCT AAGAC CT CT C AGCACGGGC GG AAG AAACT CC CGG AGAGC T C A 



:! S3 
iJ 2 ! 



CCCAAAAAACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGCCT 
CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
GCTGAGTTATTTATGAATTAGAGGCTATAGCTCACATTTTCAATCCTCTATTTCTTTTTTTAAATATAACT 
TTCT AC TCTGAT GAGAGAAT GT GGT T T T AATCTC TCTCT C ACAT TT T GAT GAT T T AGACAGACTCCCCCTC 
TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 



GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 

Hi 

m ACACTTACTGAAGAAGAAGCAATAAGAGAAAGATATTTGTAATCTCTCCAGCCCATGATCTCGGTTTTCTT 
ffj. . ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 

TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTCAGAAATTGTCCCT AGATGAAT GAG AAAATT ATTT TT TT T AAT TT AAGT CCT AAAT AT AGT T AA 
AATAAAT AAT GT T T T AGT AAAATGAT AC AC T AT CT C TGT G AAAT AGCC TC ACCC CT AC AT GT GG AT AGAAG 
GAAATG AAAAAAT AAT TGCT T T GACATTGT C TAT AT GGT ACT TT GT AAAG TCAT GCT TAAGT AC AAAT TCC 
ATGAAAAGCT C ACAC C TGT AATCCT AGC AC T T TGGGAGGCTG AGGAGG AAGGAT CACT TG AGCCCAGAAGT 
T C GAGACT AGCCT GGG C AAC AT GGAG AAGC CCT GT C TCT ACAAAAT AC AG AGAGAAAAAATCAGCC AGTC A 
TGGT GGCAT AC AC CTGT AGT C C CAGC ATT C CGGGAGGCT G AG GT GGGAGG AT CACT T G AGCCC AGGGAGGT 
TGGGGC TGCAGT GAGC C ATG AT CAC ACC AC TGCACT CC AGCCAGGT GACATAGC GAGATCCT GTCT AAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 
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FIGURE 118 

MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFT I LGLPAMLQAVRALMIVGIVLGAI GLLVS I FALKC IRIGSMEDSAKANMTLTSGIMFIVSGL 
CAI AGVSVFANMLVTN FWMS TANMYTGMGGMVQT VQ TRY T FGAAL FVGWVAGGL TL I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE no 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGT.GGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
1^ CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
Q ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
El CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
J' I CAGATACTCGAT ACCT T CCCATCGC ACAACCCAAAAAAGTTAT CACACCGGAAAGAAGT CACCGAGCG 

%J ,. TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
3 J* ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 

V-.I' 

m AATCTTAATTACAGGAACTGTGCATCAGCTAT TTAT GATTCTATAAGCTATTTCAGCAGAATGAGATA 

. ~P •' 

s TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 

O CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 

^ ACG AC AT AG CAT TAT GT AC AT AG AT G AGT GT AACAT T TAT AT C T C AC AT AG AG AC AT G C T TAT AT GG T 

: K f TTTATTT AAAATGAAAT GCCAGTCCAT TACACT GAATAAAT AGAACTCAACTATT GCTTTTCAGGGAA 

Shi: 

!«J ATCATGGATAGGGTT GAAGAAGGT T ACTATTAATTGTT TAAAAACAGC TTAGGGAT TAAT GTCCT CCA 

S "if 

f «! 

T TTATAATGAAGATT AAAAT GAAGGCT TTAAT CAGCATT GT AAAGGAAATT GAATGGCT T T CTGATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
C T T GT GT AT TAAAT T AACAT T T T T AAAAC GC AGATAT T T T GTCAAGG GGCT T T GCAT T CAAACT GC T T 
T T C CAGGG C TAT ACT C AGAAGAAAGAT AAAAG T GT GAT C T AAGAAAAAG T GAT GGT T T T AGGAAAGT G 
AAAAT AT TTTTGTTTTT GT AT T T GAAGAAGAAT GAT GCAT T T T GACAAGAAAT CAT AT AT GTAT G GAT 
ATAT TTTAATAAGTAT T T GAG T AC AGAC T T T G AGGT T T CAT C AAT AT AAAT AAAAGAG C AGAAAAAT A 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACA?W\AAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT ACAT T TATAT TAAT AAAT TGTACATT TTT CTAATT 
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FIGURE 120 

MAT HALE I AGL FLGGVGMVGTVAVTVMPQWRVSAFI ENN I WFENFWE GLWMNCVRQANI RMQCK 
IYDSLIALSPDLQAARGLMCAASVMSFLA 

MWLI PVSWVANAI IRDFYNS IVNVAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKSSSY 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 

GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT TCAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCTTTGAGGAGT CCTGGACACCCAAC TAC 
AAG C AG T G T T CAT GGAG T T CAT T G AAT TAT G GC AT AGAT C T T G G G AAAAT T G C GG AG T G T AC AT T 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
ATT GAAGC T ATAAT TTAT TTGGACCAAGGAAGCCCTGAAATGAATT CAACAATT AATATTCATCG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
ATTATTGAAGAACTACCAAAATAAATGCTTTAATTTTCATTTGCTACCTCTTTTTTTATTATGCC 
TTGGAATGGTTCACTTAAATGACATT TTAAATAAGT TTATGTATACAT CT GAATGAAAAGCAAAG 
CTAAATATGTTTACAGACCAAAGTGTGATTTCACACTGTTTTTAAATCTAGCATTATTCATTTTG 
CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGCTACCAATCTTTGTACAATTTGTAAATGTTAAGAATTTTTTTTATATCTGT 
T AAAT AAAAAT TAT T T CC AACA 
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FIGURE 122 

MRPQGPAAS PQRLRGLLLLLLLQLPAPS S ASE I PKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFS GS LRLKCRNACCQRWY FT FNGAE C S GPL P I E AI I YLDQG SPEMNS T IN I HRT S 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSYSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT T G C C T G G AAGAAT AC AT CAT G T T T T T C G AT AAGAAG AAAT T G TAG GAT C C AG TTTTTTTTT T A 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCC CT AAGCCAAAGCAAAAGAC CTAAGGACGACCT T TGAACAAT ACAAAGGATGGGT T TCAAT G 
TAAT TAGGC T AC TGAGCGGAT CAGC T GTAGCAC TGGTTATAGCCCCCACT GT C T TAC TGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
ATAACAGCCTTCAAAAACT TAAGTATAAT CAATT TAAAGGGCT CAACCAGCTCACCTGGCTATAC 
CT TGACCAT AACCATATCAGCAATAT T GACGAAAATGC TT T TAAT GGAATACGCAGAC T CAAAGA 
GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGACTGC CGCAACC T GGAACTT T TGGACCT GGGATATAAC CGGAT CC GAAGT T TAGCCAGGAAT G 
TCTTTGCTGGCATGATCAGACTCAAAGAACTTCACCTGGAGCACAATCAATTTTCCAAGCTCAAC 
CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
C AT AGGACAGAC CAT GT C C TGGACC T G GAGCT C C T T AC AAAGG C T T GAT T TAT CAGG C AAT GAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 
ATCGATGCAGTGAAGAACTACAGCATCTGTGGCAAAAGTACTACAGAGAGGTTTGATCTGGCCAG 
GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGTCCCTAAAGCAAATGAC TCCCAGCACCCAGGAATT TTATGTA 
GATTATAAACCCACCAACACGGAGACCAGCGAGATGCTGCTGAATGGGACGGGACCCTGCACCTA 
TAACAAATCGGGC TCCAGGGAGT GT GAGGT ATGAAC CATT G TGAT AAAAAGAGC TCT TAAAAGC T 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGT CAT T T T CC TC TC AT AC ATAAT C AAC C CAT T GAAAT T TAAAT AC C ACAAT CAAT G T GAAGCT T 
GAACTCCGGTTTAATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTT 
GTTTTAAGATAAAACTTCTTTCATAGGTAAAAAAAAAAA 
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FIGURE 124 

MGFWIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKIWyCESQKLQEIPSSISAGCLG 
LSLRYNS LQKLKYNQFKGLNQLT WL YLDHNH I SNI DENAFNGI RRLKE L I LS SNR I S Y FLNNT FR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
IAR1WFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKSFKGLRENTIICASPKELQGVWIDAVKNYSICGKSTTERFDLARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 13 6-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 



APP ID=10063736 



Page 278 of 322 



FIGURE 125 



CCGTTATCGTCTTGCGCTACTGCTGA ATG TCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT T GAAGAGGAAGGC TT T C TAA&GC T TT GGCAAGGAG TGACACCCGCCAT T TACAGACACG TAG T 
GTAT T CTGGAGGT C GAAT GGT CAC AT AT GAAC AT C T C CGAGAGGT T G T GT T T GGCAAAAGT GAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
t xh TTAGCCAATCCAACTGACCTAGTGAAGGTTCAGATGCAAATGGAAGGAAAAAGGAAACTGGAAGG 
P AAAACCAT TGCGAT T TCGT GGT GTACAT CAT GCAT T T GCAAAAATC T T AGCTGAAGGAGGAATAC 

0 • GAGGGCTTTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATTTA 
JJ ! ACCACTTATGATACAGTGAAACACTACTTGGTATTGAATACACCACTTGAGGACAATATCATGAC 

TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
hi TCAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACTTTTGTATAAATCATCG 

01 ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
l m ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
*!f T GAGT GGAGTCAGTC CAT T T TAA 



;5 5 
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FIGURE 126 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
V I G GMMAG V I G Q FLAN PT DL VKVQMQME GKRKLE GK P LR FR GVHHA FAK I LAE GG I RG L WAGW VP 
NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAJVTGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTG7VACGTGCTGGCCCTCAGCATCTGCACACGGGAAGGCTACCAGTCCATGA 
O AGGAGCGGAATGTGGACGATGGGCACAT CAT TAACAT CAATAGCAT GT CT GGCCACCGAGT GT T A 

W CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 

M f w 

f I GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 

'* K 5 

AGACACAATTCGCCTTCAAACTCCACGACAAGGACCCTGAGAAGGCAGCTGCCACCTATGAGCAA 
^1 ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
111 CATC CAGAT T GGAGACAT CCAGAT GAGGCCCACGGAGCAGGTGAC CTAGTGAC TG TGGGAGC TCC 

* _ TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
?«! CACGGGAT ACCAC T T C C TGTCCACACCCCGACCAGGGGC TAGAAAAT T TG T T T GAGAT T TT TATA 

Li s 

fl TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
H! TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
U TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 

TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
• CAGAACACCAGGGCC TGGCCCAGTGGATT TCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 

MARPGMERWRDRLALVTGASGGIGAAVA^ 

PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFNVNVLALSICTR 
EAYQSMKERNVDDGHI ININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide : 
amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 

AACTTCTACATCGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
C CGT T GAG TGC T TGAA ATGA GGAAC T GAG AAAAT TAAT T T C T C AT GT AT T T T TCTCAT T TAT T T A 
TTAATTTTTAACTGATAGT TGTACATAT TTGGGGGTACATGTGATAT T TGGATACATGTATACAA 
TATAT AAT GAT CAAATCAGGGTAAC T GGGAT AT CCATCACATCAAACAT T TAT T T TT TATT C T T T 
T T AGACAGAG T C TCAC TC T GT CACC CAGGCT GGAGTGCAG T GGT GCCATC TCAGC T TACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
T TCCATC CAT GT TGCTGCAAAT GACAG GAT TTCGTTCT TAAT T T CAAT TAAAATAAC CACACAT G 
GCAAAAA 
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FIGURE lao 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 8 6-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE lfli 



TT CT GAAGTAAC GG AAGC T ACCTT GT AT AAAGACCT CAAC ACTGCTGACC ATGATC AGCGC AGCCT GGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT AC C AGAGGAT GCT ACAACTCTCT ACC TT CAGAAC AACC AAAT AAAT AAT GCT GGGATT CCTTC AGAT 
T T GAAAAACTTGCT GAAAGT AG AAAGAAT ATACCT AT ACC AC AAC AGT TT AGAT GAAT T T CC TACC AACC T 
CC CAAAGT AT GT AAAAGAGT T ACATT TGCAAGAAAATAACAT AAGG AC TATCAC TT AT G ATT CACTT T C AA 
AAAT TC CCT ATC TGGAAG AAT T ACAT TT AG AT GACAACT CTGTCTCT G C AGTT AGC AT AGAAGAGGGAGC A 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GC CCAGGACT AT AGAAGAACTACGCT TGGATG AT AATCGC AT AT CC AC T ATT T CAT C AC C AT CT CT TC AAG 
III GT CTCACT AGT C TAAAACGC CTGGTT CT AG AT GGAAAC CT GT TGAACAAT C ATG GT T T AG GT GACAAAGT T 

III T T CT T C AACC T AGT T AAT TT GACAGAGCT GTC CCT GGT GCGGAATT CCCTGACT GCT GCACC AGTAAACCT 

f. 3 TCCAGGCACAAACCTGAGGAAGCTTT ATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 

%J CT TAT C T AAGGC AGCT CT AT CGAC TG GAT ATGT CCAAT AATAAC CT AAGT AAT T TACCT C AGGGTAT CTT T 

hj . GAT GAT TT GG AC AAT AT AAC AC AACT GAT T CT T C GC AAC AAT CCCTGGT ATTGC GGGTGC AAGATGAAAT G 

C!l GGT AC GTGAC TGGT TACAAT CACT AC CT GT G AAGGT C AACGT GCGTGGGC TC AT GT GCC AAGCC CC AG AAA 

n-t 

JS 



AGGT TC GT GGGATG GCTATT AAGG AT CT CAAT GC AGAACTGTTT GATT GT AAGG AC AGT GGGAT T GT AAGC 
ACCATT CAGATAAC CACT GC AAT ACC CAAC AC AGT GTAT CCT GC CCAAGG ACAGTG GCC AGCTCC AGT GAC 
C AAAC AGCC AGAT AT T AAGAACCCCAAGCT CACTAAGG ATCAAC AAACC AC AGGGAGT CC CTC AAGAAAAA 



!]: CAAT TACAAT T ACT GT GAAGTCTGT C ACCT CT GAT ACC ATT CAT AT CT CT TGGAAACT TGCT CT ACCT AT G 

2! ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 

AACAGGGGAACGCAGTGAGTACTTGGTCACAGCCCTGGAGCCTGATTCACCCTATAAAGTATGCATGGTTC 
CCATGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 
CGAAT GTACAAC CC T ACAAC C ACCCT C AAT CG AG AGCAAG AG AAAGAACCTT AC AAAAAC CCCAATTT AC C 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TT C AT AGGAATG GAT C GCTC TTCT CAAGGAACT GT GCAT AT AGC AAAGGGAGGAGAAGAAAG GATG ACT AT 
GC AG AAGC TGGC AC T AAGAAGGAC AACTCT AT CCTGGAAAT C AG GGAAAC TTCT TT T CAG ATGTTACC AAT 
AAGC AATGAACCCAT C TCGAAGGAGG AGT T TGT AAT AC AC AC C ATATT TCCT CCT AAT GG AAT GAAT CTGT 
AC AAAAACAATC AC AGTGAAAGC AGT AGT AACCGAAGCT ACAGAGAC AGT GGT ATT C CAG AC T C AGAT C AC 
TCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTTAAACCTAAGGGAGGTGATG 
GT 
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FIGURE 132 

MISAAWSI FLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFI YCNDRFLTS IPTGI PEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVS AVS I EEGAFRDSNYLRLL FLS RNHLS T I PWGLPRT I EELRLDDNRI S T I S S PS L 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAIKDLNAELFDCKDSGIVS T IQI TTAI PNTVYPAQGQWPAPVTKQPD 
I KNPKLTKDQQT TGS PSRKT I T I T VKSVT S DT I H I S WKLAL PMTALRLSWLKLGHS PAFGS I TE T 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAI IGGAVALVT I ALLALVCWYVHRNGS L FSRNCAYS KGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain : 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern . 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
H TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 

C TC CACT AC C TCAAGC TGAGTGAT CCCAAG TACC TAAGAGAGT T CCAGC T GACCC TCCAGCCCGG 
2J GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
jhj CCCAGGAC T CAT TC T CAGAGGAGAGAAGTGAC GTGT GCC T GGT GCAGC TGC TGGGAACGGGGACG 

%l GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
W AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
¥5 GACCACT CCAACAGAGCCAGGAC TAT AT CAACC T C T T CT GCGCCAACAT GATGGACT T GAACCGC 

AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
Iff TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
P AACAGCAGGAAGGATGC T TCGGGGAGCCT GAT GCTGAAGAT GAAGAAT T AT CTAAAGC TAT T CAA 

W TAT CAGCAGCAT TT T TCGAGGAGAGT GAAGAGGCGAGAAAAACAAT T T CCAGAT TCTCGC TC TGT 

h I* 

jj! . TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 

AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
G T GGC T T C C T ATAC ATCC TGGCAGAATACC CC CCAGCAAACAGAGAGCC AC ACC CATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAA.GTTCAACTGCi\ACTGAAAAAAAAAAA 
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FIGURE 124 



MSARGRWEGGGRRACRGS LGLARAQGAERVTS SEQRPAMASLGLLLLLLLTALPPLWS S SLPGLD 
TAESKAT I ADLI LSALERATVFLEQRLPE INLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 



Important features of the protein: 



Signal peptide: 




amino acids 1-2 6 



Transmembrane domain: 



amino acids 39-56 



Tyrosine kinase phosphorylation sites. 



amino acids 149-156, 274-282 



N-myristoylation sites. 



amino acids 10-16, 20-26, 63-69, 208-214 



Amidation site. 



amino acids 10-14 



Glycoprotein hormones beta chain signature 1 . 



amino acids 230-237 



APP ID=1 0063736 



Page 28 8 of 322 



FIGURE ias 



GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGAT TTGATCCCGTTCGAGTGGATATCACTTCGAAAGGAAAAATGAGAGCAAGATAT G 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
CTGACAT GAGACGGGAAATGGAGCAG T CAAT GAATATGC T GAAT TCCAACCAT GAGTT GCC TGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGG TAG TCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGC7\ACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
GAGT CAT CCCGACGT T GAT CTCTT ACAAC T GT GTATG TT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAG G T CAAT AT T GAT G T CAC T GAAT T AAT T AC AG T G T CC TAT AGAAAAT G C CAT T AAT AAAT TAT 
AT GAAC T AC T AT ACAT TAT G TAT AT T AAT T AAAACAT C T TAAT C CAGAAAT CAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 
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FIGURE 136 



MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGS FWHDI PSGS YWEWS PAYRFDPVRVDI TSKGKMRARYWYIKTSE 
WRLPYPLQMKSSGPPSYFIKRESWGWTDFIMNPMVMMMVLPLLIF^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of" the protein: 
Signal sequence : 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 



■rift 



$ ;j •-' 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTG 
T GCAAGCCAAAGATGGAAACAT T GACATCAGAATC T TAAGGAGGAC T GAG TC T T T GCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
\l CCAT CAAGAAGGACC TCCGGCTC TC TCAT GCCCACAT GACATGC CAT T GT GGGGAGGAAGCAAT G 

%ll AAGAAATACAGCCAGATTCTGAGTCAC TT T GAAAA.GCTGGAAC C TCAGGCAGCAGT T GTGAAGGC 

Cf- TTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAGGAGACAGAATAGGAGGAAAGTGATGCTG 

CTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAA 
1^ ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
Q C T T CC TT GCAT GAT T G TC T T TATGCAT CC CCAATCT TAAT TGAGACCATAC T TGTATAAGAT TT T 

f|| TGTAATATCTTTCTGCTATTGGATATATTTATTAGTTAATATATTTATTTATTTTTTGCTATTTA 
P AT G T AT T TAT T T T T T T AC T T GG AC AT G AAAC T T T AAAAAAAT T C AC AG AT TAT AT T T AT AAC C T G 

1 tj AC TAG AG CAGG T GAT G TAT T T T TAT AC AG T AAAAAAAAAAAAC C T T G T AAAT T C TAGAAGAG T G G 

CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
C CAGGAATCCTACAC GGCCAGCATGTAT T T CTACAAATAAAGT T TT C T TT GCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQ FPKT S FDI S PEMS FS I YS LQVPAVPGLTCWALTAE PGWGQNKGAT TCATNSHSDSELRPE I F 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSVQAKDGNIDIRILRRTE3LQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
SLANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAVVKALGELDILLQWMEET 
E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACG ATG CGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACC T CT CT T CAG TAAAGT T G T TAT T GT T C T GATAGAT GCCT T GAGAGAT GAT TTTGTGTTTG 
GGT C AAAGGGT GT GAAAT T TATGCCC T ACACAAC T TACCT T GT GGAAAAAGGAGCAT C T C ACAGT 
T T T G TGGCTGAAGCAAAGCCACC TACAG T TAC TAT GCC T CGAAT CAAGGCAT TGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAZ\AAGCAGCTGGAAAAAGAATAGTCTTTTATGGAGATGAAACCTGGGTTAAATTA 
TTCCCAAAGCATTTTGTGGAATATGATGGAACAACCTCATTTTTCGTGTCAGATTACACAGAGGT 
GGAT AATAATG TCAC GAGGCAT T TGGATAAAGTAT TAAAAAGAGGAGAT T G GGACATATTAATCC 
TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
AAGC T GAGCGAGATGGACAGCGT GC T GAT GAAGAT CCACACCT CACT GCAGTCGAAGGAGAGAGA 
GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
CCCGGTGATATCCGACATCCAAAGCACGTCCA ATAGA CGGATGTGGCTGCGACACTGGCGATAGC 
ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
CAAT GAGAGAGCAGT TGAGAT T T T TACAT T T GAATACAG T GCAGCT T AGTAAACTGT T GCAAGAG 
AAT GT GCCGT CATAT GAAAAAGAT CC TGGGT T TGAGCAGT T TAAAAT GT CAGAAAGAT TGCATGG 
GAACT GGAT CAGAC T GTACT TGGAGGAAAAGCAT T CAGAAG TCCT AT TCAACC T GGGC TCCAAGG 
TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCG.TGGCTGGCGGCAGGCT 
GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CTGCACACAGTATGTAGT TACCAAAAGAATAAA.CGGCAATAAT TGAGAAAAAAAA 
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FIGURE 14Q 



MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKmPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEWTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-76 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites . 

amino acids 6-12/ 52-58, 100-106/ 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site, 
amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 



GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGAAGAACTTAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAGTT CCAGATAAAAACTACATACGCCCAGAGATC TTC TTT GCATTAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCATCCATCCCTTCAGCTGAAGAAGGAGAAACTGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 

IT''** T, 

p; GTTGGGGT GACAGATAAAT T T GAG AACAG G AAACACAT T GAAT TTT CAT T T C AAC C AG TTT GC AA 

ffl AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
{\l ATT TGAAC TAAT TGTATAAAAACACCAAACCTGCTCACT 

'1 I" 
vl 

S s s 

IT 2 s 
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FIGURE 142 

MLLLLLE YNFP I ENNCQHLKT THT FRVKNLNPKKFS I HDQDHKVLVLDSGNL I AVPDKNYI RPE I 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-3 6 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 142 



C TAGAGAG TATAGGGCAGAAGGAT GGCAGATGAG T GACTCCACATC CAGAGC TGCCTCCCT T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACC ATG CTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGT GGAGT T TGCTG TCCACACAT T CAACCAACAGAGCAAGGAC TAC TAT GCC TACAGACTGGGG 
a, CACAT CT T GAAT TCC TGGAAGGAGCAGGT GGAG TCCAAGAC TGT AT TC TCAATGGAGCTAC T GC T 

Q GGGGAGAAC TAGGTG TGGGAAAT T T GAAGACGACAT T GACAACT GCCAT T T CCAAGAAAGCACAG 

Cl AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 

m 

C TCC T GAACAAGACC TGC T TGGAGGGAT T CCAC TGAGTGAAACC CACT CACAGGC TT GT CCATGT 
T% GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
g a f TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
!|1 ACACATCT TGAGCC TAAT CATGTAG T GT AGATCAT TAAACATCAGCAT TT TAAGAAAAAAAAAAA 

■ AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

«•• fm »• 
rrv'f 

*i 3 

mfmti 
•* '•. --I 

if? 

•S? 5 

u * 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHI LNSWKEQVESK TVFSMELLLGRTRCGKFEDDI DNCHFQES TELNNT FTCFFT I S TRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 145 

CT GT GCAGCTC GAGGC TCCAGAGGCACAC TCCAGAGAGAGCCAAGGT T CT GACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACT T CGGAGCCGAGGGCAACAGGTAC TACGAGGCCAACTACT GGCAG T T C CCCGAT GGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGC T TGCCAGGAGGCTGGCAGTACAGAG CGCAGCAGCGAGCAAATC CTGGCAAG TGACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACACATCAGGCACTGCGCCACCTGCT TCACAGTACT TCCCAACAACTCTTAGAGGTAG 
GTGTAT TCCCGT TT TACAGATAAGGAAACT GAGGCCCAGAGAGCT GAAGTACTGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
CTCTAAT GAAAT TG T GAAAGCT C CAT GT T TAGAAATAAAT GAAAACACCT GA 
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FIGURE 146 



MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCS LKHCE FWLERGAGLRVTMHQPVLLCLLAL I WLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites . 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACT T T CAGAT TCAGGGGGTACAT G T GAAGGT T TGT TT TATGAGTATAT T GCATGATGC T GAGG 
TTTGGGGT 
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FIGURE 148 

MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE 14Q 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT C TCTACAT AATAGAGAATTAAGTGCAGAAAGACCTT T GAAT GAACAGATTGCT GAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
TGT TGATAACTT GAACC TGCTAAAGGCAATAACAGAAAAGGAAAAAAT T GAGAAAGAAAGACAAT CTA 
^ TAAGAAGCTCCCCACTT GATAATAAGTTGAAT GTGGAAGATGTTGATT CAACCAAGAATCGAAAACTG 

m ATCGATGATTATGACTCTACTAAGAGT GGATT GGAT CATAAATTTCAAGATGATCCAGATGGTCTTCA 

|f| T CAACTAGACGGGACT CCTT TAACCGC TGAAGACAT T GT CC ATAAAAT CGGT GCCAGGATTTAT GAAG 

yj AAAAT GACAGAGC CGT GTTT GACAAGATTGTT T CTAAACTACT TAATC T CGGCCTTATCACAGAAAGC 

M CAAGCACATACACTGGAAGAT GAAGTAGCAGAGGTT TTACAAAAATTAATC TCAAAGGAAGCCAACAA 

!i .. 5 

w T TATGAGGAGGATCCCAATAAGCCCACAAGCT GGACT GAGAAT CAGGCTGGAAAAATACCAGAGA?AG 

TGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
f~| T TAACCT T GACAAAT GGCT TGGAAAGG AGAAC TAAAACCT ACAGT GAAGACAACTTT GAGGAACT CCA 

if\ ATAT TTCCCAAAT TT CT AT GCGCTACT GAAAAGTATTGAT TCAGAAAAAGAAGCAAAAGAGAAAGAAA 

O CACTGAT TACTATCATGAAAACACTGATT GACT TT GTGAAGAT GATGGTGAAATATGGAACAATATCT 

HI CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 

AGAAAAAAATGCTACTGACAATAT AAGCAAGCT TTT CCCAGCACCATCAGAGAAGAGTCAT GAAGAAA 
CAGACAGTACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATATGGAAGCT TGAAGGAT T CCACAAAA 
GAT GATAACTCCAACC CAGGAGGAAAGACAGAT GAACCCAAAGGAAAAACAGAAGCCTAT T TGGAAGC 
CAT CAGAAAAAATATTGAAT GGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAAGATGAGAGACTT CAT CAATAAACAAGCT GATGCTTAT GT GGAGAAAGGCAT CCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGTTTCAGAAAACATAATATAGCT TAAAACACTTCTAATT CT GT GAT TAAAATTT T TT GACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAAC TAT CT GAAAGTAAAGTT GT AT GTAAGCT GAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 



MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVKYGTISPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 



Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 



N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 




N-my r i s toy 1 a ti on si tes : 



amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
AT AATAACCAGC T TC TAGC TGGAGGGCT GCAT GCAGGGAAGGT CAT TAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAG T GC C TGT CATG TGGGGT GG GGCAGGAGCCGAC T CTAACAC TAGAGCCAGT GAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGAC TAG GGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 

GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAA.GTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
GAAAT GGCTCGAGC T CAGAAGAT AAAAGATAAGTAGGGTATGC T GAT C CT C T TT TAAAAACCCAA 
GATACAATCAAAATCCCAGATGCTGGTCTCTATTCCCATGAAA^AGTGCTCATGACATATTGAGA 
AGACCTAC TTACAAAGTGGCATATATTGCAATT TATT TTAATTAAAAGATACCTATTTATATATT 
TCTTTATAGAAAAAAGTCTGGAAGAGTT.TACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
ATT CC T T GTATAAAAATAAGAAAAGAAAT TAAT C T TGAGGTAAGCAGAGCAGACATCAT CT C TGA 
TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
TTGTAGTAGTGATCAGGAAACAGATCTCAGCAAAGCCACTGAGGAGGAGGCTGTGCTGAGTTTGT 
GTGGCTGGAATCTCTGGGTAAGGAACTTAAAG7VACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAATTCAATAT 
GACTGGTT TCCT TGTATGAATUVGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGAC TATGTA 
AAGATGAAGGCAGAGAT CGGAGT T TT GCAGCCACAAGC TAAGAAACACCAAGGAT TG TGGCAACC 
AT C AGAAG C T T GGAAGAG G CAAAGAAGAAT TC T T CC C T AGAGG C T T T AGAG GGATAAC GGC T C T G 
C T GAAAC CT TAATC T CAGAC TT CCAGCCT CCTGAACGAAGAAAGAATAAAT TTCGGC TG T T T TAA 
GCCACCAAGGATAAT TGG T TACAGCAGC T CTAGGAAACTAATACAGC T GCTAAAATGAT CC C TGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAAT AT GGCAGAAGT GATG GCAT GCCAC TTCCAAGAT TAGGT TATAAAAGACACT GCAGC TT C 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGACTCTGTGAGATAATCGATGTTTGTTGTTTT 
AAGT TGCT CAGT T T T GGT C TAACTT GT TATGCAGCAATAGATAAAT AATATGCAGAGAAAGAG 
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FIGURE 152 



MVLSGALC FRMKDS ALKVL YLHNNQLLAGGLHAGKVI KGEE I S WPNRWLDAS LSPVI LGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 

VRLTQLPENGGWNAPITDFYFQQCD 
N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins . 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE. 153 

CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCA ATG GCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGAC G T TCGTCTCAT T GGGGAGAAAC TGTT CCACGGAGTCAGT AT GAGTGAGCGC TGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGC 
T T GGAGAGAGT GGAGAGATCAAAGCAAT T GGAGAACTGGAT TT GCT GT T T ATGTC TC T GAGAAAT 

J3 GCCT GCAT TTGACCAGAGCAAAGCTGAAAAATGAATAAC TAACCCC CT T T CCCTGCTAGAAATAA. 

U CAAT TAGATGCCCCAAAGCGAT T T T T T T T AACCAAAAGGAAGAT GGGAAGCCAAAC T CCAT CAT G 

F ! ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 

ill 

Cl AGACCAGAAGGTAGACTT TCTAAGCATAGATAT TTAT TGATAACAT TT CATTGTAACTGGTGTT C 

|l| TATACACAGAAAACAATT TATT T T T TAAATAAT TGTC T T T T TCCAT AAAAAAGAT T AC T T T CCAT 

ffi TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 

^ . T GTAT TT AT TAT TATT AT AAGACT GCAT T T TAT T TAT AT CATT T TAT TAATATGGAT T TAT T TAT 

agaaacatcattcgatattgctacttgagtgtaaggctaatattgatatttatgacaataattat 

; s t . 

H AGAGCTATAACATGTTTATTTGACCTCAATAAACACTTGGATATCCC 

tti - 

i| ' *3» 

s 

i* m ft 
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FIGURE 154 

MAALQKSVSSFLMGTIATSCLLLLALLVQGGAAAPISSHCRLDKSNFQQPYITNRTEMLAKEASL 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE iss 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
CAGCTGCTGGTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T TGACATGGATGATTCTGAGGAGGAAGCTGTTAT TGAATGTATAGAGAT T TATCCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 156 



MRERPRLGEDSSLISLFLQWAFLA3y[VMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide : 

amino acids 1-32 

H ; N-glycosylation site. 

5:1 amino acids 136-140 

a 1 

a. 

!Uf Tyrosine kinase phosphorylation site. 

\I amino acids 127-135 

iS.il 

N-myristoylation sites. 

amino acids 44-50, 150-156 

•* 3 ' II 

fq m 
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FIGURE 1*7 



. CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACTTGAGGGACCTCCGAGTAGAACCTGTTACAACTAGTGTTGCAACAGGGGACTATTCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATT T GT GTGAC GGGCAAAAGCAACT TCCAGT C C TACAGC T GT GTGAGGT GC AATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCGCTGTA 
GAGCTGAACACAGT C TAT T TCAT T GGGGCCCATAATAT TCC TAAT GCAAATATGAAT GAAGATG G 
CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 
AGT GT GT CAAGGCCGGAAGCC T GT GGGATCCGAACAT CAC TGC T T GTAAGAAGAATGAGGAGACA 
GTAGAAGTGAACTTCACAACCACTCCCCTGGGAAACAGATACATGGCTCTTATCCAACACAGCAC 
$\ TAT CATCGGGT T T T C T CAGGT G T T T GAGCC ACACCAGAAGAAACAAAC GCGAGC T TCAGTGGTGA 

5 jf J 

H TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 

hi AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 

|f! GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 

s . CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 

'fit TCTACCACCACACTACTGCCGCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 

TCACACAAT TTGT TACT T CAC T GAAT TT CT TCAAAAC CAT T GCAGAAG TGAGGT CATCC T TGAAA 



3 S-S 

- r« m 



ffl AGT GGCAGAAAAAGAAAATAGCAGAGAT GGGT CCAG T GCAGTGGC T T GCCAC TCAAAAGAAGGCA 

jp GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
3w GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 

GTGAT CT AAGAAGCCAGAT TCAT CT GCACAAATACGT GG TGGT C TAC T T TAGAGAGAT T GATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 

CTGTGCAGAACTTCTCCATGTCTUVGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 

MSLVLLSLAMiCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSWFTSPGCLDHIMKYKKKCVKAGSLWDPNITACKKNEETVEVNFTTTPLGNRYMALIQH 
STI IGFSQVFEPHQKKQTRASWI PVTGDSEGATVQLT PY FPTCGSDC I RHKGT WLC PQTG VP FPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPI KVLVVYPSEICFHHTICY FTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 

GCCSL 



Important features of the protein: 
!::!! Signal peptide: 

l£u amino acids 1-14 



Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 




N-my ri s toy lation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 

AGCCACCAGCGCAAC ATGA CAG T GAAGACC C T GCATGGCCCAGCCAT GGT CAAGTAC T TGC TGCT 
GTCGAT AT TGGGGC T T GCC TT TC TGAGT GAGGCGGCAGC T CGGAAAAT CCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAG TAA GAGGTGCATATCC 
AC T CAGCT GAAGAAG 
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FIGURE 160 



MTVKTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGC SVS FQLEKVL VT VGC TCVT PVI HHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 



% ? l N-myristoylation sites. 

amino acids 106-111, 136-141 

r± % ?• 
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FIGURE 161 

ACACTGGCCAAACAAAAACGAAAGCACT CCGTGC T GGAAGTAGGAGGAGAG T CAGGAC TCC CAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCTGAAGATGAGGAAAAGTTTGGAGGAGCAGCTGACTCAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
T GCT T CGAGGC TGCCC TAGGGAGT GAGGTACGAATC TGGT CCTATAC T CAGCCCAGG TACGAGAA 
GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 

CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
GTGGC TGTACT TCAC TACCCAGCAAAGCC T CCACGAGGGCAGC T CGC C T TGGAGAGTAC T TAG TA 
C AAGACC T GCAGTCAGGCCAGT GTC T GCAGCTATGGGACGATGAC T T GGGAGCGC TATGGGCC T G 
CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGGCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACTTAAATAAAGGCAGACGCTGTTTTTCTAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHKRWALVWLACLLFAAALSLILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGVWLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence : 
amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCG ATGA GGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGC CGGTCAGC CACCAAGAT GACT GACAGGT T CAGCT C T C TGCAGCACACTAC 
C CT CAAGCCACC TGAT GTGACCT GTATCT CCAAAG T GAGAT CGAT TCAGAT GAT T G T TCATCC TA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T TCTACCACT TAGAGCTCCAGGT CAACCGCACC TACCAAAT GCACC T T GGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
; CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 

5" TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
Cl CTACAGATATGTCAGCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 

III TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 

f|f AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
T T CCACAGC GGCATAGCC TGTCCGAGAT CACCTAC T TAGGGCAGC CAGACAT CTCCATCC TCCAGC 

J&H CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 

M GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

|i| GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 

m CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 

CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 

J s . TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 

P CCCTGGGGATT T GCACAGACAGAACAT CT GACCCAAATGTGCT ACACAGT GGGGAGGAAGGGACA 

|f| CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 

H CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 

JJI TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 

'^2 GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 

Cl AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 

111 ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAAT GACAGT GCAAGGAG GAAATGCAGGGAAAC TCCCGAGGT CCAGAGCCC CACC TC 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAAT CTAGCTCGACAGAGCATGAGGCCCCTGCCTCT TCTGTCAT TGT TCAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACCAGGGCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAAT GCAGGT C CACCAGGGAGGGAGACACACAAGC C T T T T CT GCAGGCAGGAGT T T CAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CT CACCACT CACAAGC TGTGTGACTTCAAACAAATGAAATCAGT GCCCAGAACC TCGGT TTCC TC 
ATCT GT AAT GT GGGGATCATAACACC TACC T CAT GGAG T T GT GGT GAAGAT GAAATGAAGTCATG 

TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE 164 



MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYS FSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
S PS DQGPS PWGLLES LVCPKDEAKS PAPET SDLEQP TELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain . 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 



APP ID=10063736 



Page 318 of 322 



FIGURE 165 

TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAZW^GTCACCCGGGCCCGCGGTGGCCACAAC ATG G 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGG TGAC GATGTATATG TCTAC TACAAAC TGGCAGGGGGATCC CT T GAAC TT T GGGC TGGA 
AGT GT T GAACACAGT T T TGGATAT T T TC CAAAAGAT T TGATCAAGGT ACT T CATAAATACACGGA 
AGAAGAGC TACATAT TCCAGCAGATGAGACAGACT T TGTC TGCT TT GAAGGAGGAAGAGAT GAT T 
TTAATAGTTATAATGTAGAAGAGCTTTTAGGATCTTTGGAACTGGAGGACTCTGTACCTGAAGAG 
TCGAAGAAAGCTGAAGAAGTTTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATGCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGT GGACAGTGCGT TATT 
CAT TACAGCAAAGGATTTCGTT GGCATCAAAATCTAAGTT TGTTTTACAAAGAT TGTT TT TAG TA 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLFWLFVLGALWWPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKMjEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-76 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 

CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCT T GAGAAT GAAGCAGAACAGAAGT TAAT AGAGAGCATGT T GCAAAACC T GACAAAACC CGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
Q ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
fll CTT GGGGGTCCC TACCT T T AC CAGT GGAAT GAT GACAGGTGTAACAT GAAGCACAAT TATAT TT G 

CAAG TATGAACCAGAGAT TAAT CCAACAGCCC CTGTAGAAAAGCCT TAT CT TACAAATCAACCAG 
Si j[ GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
III ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 

GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGTCTACACTGT GGAT T TCAAAGAGTA 
s,J CCAGAAAAGAAAGTGGCATGGAAGT ATAA T AAC TCAT TGAC TT GGT T CCAGAAT T TTGTAAT TCT 

2? GGAT CT GT ATAAGGAAT GGCAT CAGAACAATAGC T T GGAATGGC T T GAAATC ACAAAGGATC TGC 

y * AAGAT GAAC T G TAAGC TC CCCC T T GAGGC AAAT AT T AAAGT AAT T T T TATAT G T CTAT TAT T TC A 

8 TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
f J ACT T CAAAC T T CAAGCAAATGAAAT GGACAAT GCAGATAAAGT T GT TATCAACACGT CGGGAGT A 

jH TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
%! . T GT AT AT TGTAT TGAAAT T TACAGT GTGCAAAAGTAT TT TACCT TTGCATAAGTGT TTGATAAAA 

U ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
|0 AAACTTATTACTGT TGTCAACTGAAT TCACACACACACAAATATAGTACCATAGAAAAAGT TTGT 

:r! TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
fjl AATAAGAAGC TATT TCAT TAAG T GT GATATAAACC TCC T CAAACAT T T T AC T TAGAGGCAAGGAT 

3ri TGTCTAATTTCAATTGTGCAAGACATGTGCCTTATAATTATTTTTAGCTTAAAATTAAACAGATT 

T T G TAAT AAT G T AAC T T T G T TAAT AG G T G C AT AAACAC TAAT G C AG T C AAT T T G AACAAAAG AAG 
T G AC AT AC AC AAT AT AAAT CAT AT G T C T T C AC AC G T T GC C TAT AT AAT G AG AAG C AGC T C T C T G A 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACAT T T T AGGAAGGAAAGGAAC TACGAAAT CGT GT GAAAAT GGG T TGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 

GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 

YRNWYTDEPSCGSEKCVVMYHQPTMIPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 

PYLTNQPGDTHQNVVVTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-269 

N-myristoylation sites . 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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